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NOVEL PROBES FOR THE DETECTION OF MYCOBACTERIA 

The present Invention relates to novel probes and to mixtures of such probes, in addition to 
the design, constiructlon and uise of such novel probes or mbctures thereof for detecting a 

5 target sequence of one or more nfiycobacteria, virtiich probes are capable of detecting such 
organlsm(s) optionally present in a test sample, e.g. sputum, laryngeal swabs, gastric lavage, 
bronchial washings, biopsies, aspirates, expectorates, body fluids (spinal, pleural, pericardial, 
synovial, btdod, pus, bone marrow), urine, tissue sections as well as food samples, soil, air 
and water samples and cultures thereof. The invention relates in particular to novel probes 

1 0 and mixtures thereof for detecting the presisnce of one or more mycobacteria of the 

Mycobacterium tuberculosis Complex (MTC) and for detecting the presence of one or more 
mycobacteria other than rtiycobacteria of the Mycobactierlum tuberculosis Corhplex (MOTT). 
The Invention further relates to diagnostic kits corhprising one or more of such probes. The 
probes of the present invention are suiprisingiy able to penetrate the cell wall of the 

15 rhycobacteria, thus making poissible the development of fast an easy-performed In situ 
protocolis. 

BACKGROUND OF THE INVENTION 

20 Tuberculosis is a very life-threaitening and highly epidemic disease which is caused by 
infection with Mycobacterium tuberculosis. Tuberculosis is presently the predominant 
infectious cause of morbidity and mortelity world-wide, and is estimated to kill abdiit three 
million people annually. WHO estimates that the annual number of new cases of tuberculosis 
will Increase from 7.5 mlllioh In 1990 to 10.2 mlliion in 2000, an escalation that will result In 

25 approximately 90 million new cases during this decade. It is furthemiore estimated that 30 
million people will die from tuberculosis during the 1990s, which equals one quarter of 
|3reveinteble deaths among adults. 

The prevalence of tuberculosis has been very high in the poorer parts of the worid such as 
30 Asia, Africa arid South-America, but In recent years an increase has also been observed in 
industrialised countries. This appears to be due to an interaction of various factors including 
la. patterns of migratton, pooriy organised tuberculosis programmes arid nutrition probienns: 
. Furthermore, a serious threat will arise frorti the emergence of new strains that are drug 
resistent or worse, multi-drug resistent 

35 

Mycobacteria are often divided into tuberculous mycobacteria, i.e. mycobacteria of the 
Mycobacterium tuberculosis Complex (MTC), and non-tubercuious mycobacteria, i.e. 
mycobacteria other than those of the Mycobacterium tuberculosis Complex (MOTT). The MTC 
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group comprises apart frorn M. tuberculdsls. M. bovis, M. africanum and M. microti. 
Mycobacteria of the MOTT group are not nonnally pathogenic to healthy individuals but may 
causie disease in ihnmunocompromlsed individuals. e.g. individuals infected with HIV. Clinical 
relevant mycobacteria of the MOTT group are in particular M. avium. M. intracellulare. M. 
5 kansasii arid M. gordonae, but also M. scrisfulaceum, M. xenopi and M. fortuitum. 

M, avium and M. intracellulare together with M. paratuberculosis arid M. lepraemurium 
constitute the Mycobacterium avium Complex (MAC). Extended with M, scrofulaceum, the 
group is named Mycobacterium avium -intracellulare -scrofulaceum Complex (MAIS). 

10 

It is well-known that treatment of mycobacterial infections with antibiotics may lead to the 
emergience of drug resistant strains. Many antibiotic drugs excert their effects by interfering 
with protein synthesis or with transcription. Studies of the molecular mechanisms underlying 
certain antibiotic resistance phenotypes in clinical mycobacterium isolates have revealed 

16 hiutations in rRNA genes. The development of resistance because of mutation(s) located in 
the riRNA gene is likely to occur since slow-growing mycobacteria have only a single rRNA 
operon. All mycobacteria populations comprise a minority of drug resistant riiutants that have 
arisen by spontaneous mutation. These mutated mycobacteria do nomnally not survive 
particularly well, but, when single-drug therapy is offered as treatment, the drug susceptible 

20 bacteria are killed, and only the resistant mutants will survive and rfiultiply, and, thus at some 
point, constitute the majority of the mycobacterial population. The selection of drug resistant 
bacteria due to inadequate drug therapy leads to a state of so-called "acquired drug- 
resistance". In contrast, "pririiary drug-resistance" is used to characterise a situation where 
drug-resistant mycobacteria can be isolated from a patient who has never been treated for 

25 niycobacterial infection, and lias become infected with drug-resistant mycobacteria from an 
individual suffering froni infection with an acquired drug resistant bacterium. 

Today, dnjg-fesistance is determined primarily phenotypically by culturing clinical samples, in 
which presence of mycobacteria have been demonstrated, in the presence of the Individual 
30 drugs. This is unfortunately a very slow and time-consuming procedure as the result of the 
drug-resistance studies depends on the growth rate of the mycobacteria, which are well- 
known to be slow. Thus, the result is not available until after several weeks. 

Although the incidence of drug-resistance is, at least not yet very common, it is nevertheless 
35 very important that resistant strains are identified and eradicated. Therefore, it is of major 
importance to find a reliable and rapidly perfomied method of diagnosing drug-resistance. 



Presently, the detection of mycobacteria by microscopy is the rhost prevalent method for 
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diagnosis. The sample (e.g. an expectorate) is stained for the presence of acid-fast bacilli 
using e.g. Ziehl-Neelsen staining. However, staining for acid-fast bacilli does not provide the 
necessary infbnnatlon about the type of inlection. only whether acid fast bacilli are present in 
the sample, and this is in Itself not sufficient infonnation for establishing a diagnosis. Samples 
positive for add fast bacilli, may subsequently be cultured in order to be able to perfbnn 
species identification. 

Since Ziehl-Neelsen staining cannot be used to determine whether the infection is caused by 
mycobacteria of the MTC group or mycobacteria other than mycobacteria of the MTC group, a 
ix)sitive staining frequently leads to very costiy isolation of all the patients with suspected 
tuberculosis infection as well as treatment with medicaments to which the patient may not 
ev^n respond. 

Since the sensitivity of acid fast staining is only approximately 10*-10^ per ml smear negative 
samples should also be cultured aS culture-based tests are sensitive, and as it may be 
possible to detect 10-100 organisms per sample, but the result is not available before up to 8 
weeks of culturing. Lilcewise, information about drug susceptibility is not available until after 1- 
3 weetcs of further testing. 

Different solid or liquid media (Loewenstein Jensen slants and Dubos broth) have traditionally 
been used for culturing of mycobacteria-containing samples. Newer media include ESP Myco 
Culture System (Difco), MB/BacT (Organon Teknika), BacTec (Becton Dickinson) and MGIT 
(Becton Dickinson). These test media are based on colourmetric or fluorometric detection of 
carbon dioxide or oxygen produced by mycobacterial metabolism, and adapted to automated 
systems for large scale testing. 

Species identification is presentiy canied out following culturing using traditional biocherriical 
methods or probe hybridisation assays (e.g. AccuProbe by Gen-Probe Inc., USA). There is, 
therefore, an increasing need for nieans allowing a more rapid distinction between 
mycobacteria of the MTC group and mycobacteria ottier ttian those of Uie MTC group, and for 
furtiler species identificatton of tifiose especially mycobacteria other ttian those of the MTC 
group. 

A number of new attempts to replace the culture-based methods relies on molecular 
amplification technology. Several metiiods have emerged, among them the polymerase chain 
reaction (PGR), ttie ligase chain reaction and transcription mediated amplification. The basic 
principle of amplification methods is that a specific nucleic acid sequence of the mycobacteria 
is amplified to increase tiie copy number of tiie specific sequence to a level where the 
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amplicon may be detectable. In principle, the methods offers the possibility of detecting only 
one target sequence, thus, in principle, making detection of mycobacteria present at low levels 
possible. However, it has become dear that the target amplification methods cannot replace 
culture-based methods as only samples which are positive by staining for acid fast bacilli 
5 (AFB) give a satisfactory sensitivity. Furthermore, specific problems exist for each method. 
The PGR method may give false negative results due to the presence of inhibitors such as 
haenruaglobin. Another problem arises froix) cross-contamination of negative specimens and/or 
reagents with amplified nucleic acid present In the laboratory environment leading to false 
positive results. A disadvantage is that costly reagents are needed for performing these tests. 
10 Furthermore, specialised instrumentation is required, making these tests mainly useful in large 
specialised laboratories, and generally not applicable in smaller clinical laboratories. 

Nucleic acid probes for detecting rRNA of mycobacteria have been described In for example 
US 5 547 842, EP-A 0 572 120 and US 5 422 242. 

15 

Considering the perspective and impact the disease has, the development of rapid and 
preferably easy-performed and further econorriic feasible diagnostic detection tests are of 
utmost importance and would be a very valuable tool In the fight against the spread of 
tuberculosis. 

20 

Peptide nucleic acids are pseudb-peptides with DNA-binding capability. The compounds were 
first reported in the early nineties in connection with a series of attempts to design nucleotide 
analogues capable of hybridising, in a sequence-specific fashion, to DNA and RNA, of. WO 
92/20702. 

25 

Hybridisation of peptide nucleic acid probes to DNA and to RNA has been shown to obey the 
Watson-Crick base pairing rules, and peptide nudlBic acid probes have been found to 
hybridise to a DNA or a RNA taiiget with higher affinity and specificity than the nucleic acid 
counterparts. These properties are ascribed to the uncharged, as opposed to the charged. 

30 structure of the peptide nucleic add and DNA or RNA backbones, respectively, and to the high 
confomiatidnal flexibility of the peptide nudeic add molecules. These features - together with 
the documented stability of peptide nucleic acid towards a variety of naturally occumng nuc- 
leases and proteases that usually degrade DNA, RNA or proteins - invite for use of peptide 
nucleic acid probes as antisense therapeutic agents and opens potentially important 

35 applications in diagnostics. 

SUMMARY OF THE INVENTION 
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The present invention relates to novel peptide nucleic acid probes and to mixtures of such 
probes for detecting a target sequence of one or more mycobacteria optionally present in a 
sample. In accordance with claim 1. the probes are directed to target sequences of 
mycobacterial rRNA, genomic sequences corresponding to said rRIMA (rDNA) and precursor 
5 rRNA. rRNA is present in a high number of copies in each cell, and is hence a well suited 
target. The probes are. as defined in claim 2, suitably directed to target sequences of 
mycobacterial rDNA, precursor riRNA, or 23S, 16S or 5S rRNA. 

Thus, in a first aspect, the invention features a hybridisation assay probe and a mixture of 
10 such probes for detecting a target sequence of one or more mycobacteria In accordance with 
claim 1 and 2. Under appropriate stringency conditions, Such probes should not to any 
significant degree cross-react with ribosomal nucleic acid from other not relevant organisms, 
present in the test sample, in particular other mycobacteria. Cross-reactivity to organisms that, 
are unlikely to be present in the sample may not be of importance. In in situ assays implying 
15 examinatidn by micrdscdpy, it is further possible to distinguish mycobacteria from other 
bacteria based on 0ie morphology of these bacilli. 

the invention also relates to peptide nucleic acid probes in accordance with claim 3 for 
obtaining a target sequence and in accordance with claim 4 for obtaining a probe. 

20 

In another aspect, the Invention relates to novel peptide nucleic acid probes for detecting a 
target sequence of one or more mycobacteria of the MTC group, and one or more 
mycobacteria other than mycobacteria of the MTC group, which probes comprise from 6 to 30 
polymerised peptide nucleic acid moieties (claim 5). Suitable probes of formula (I) are claimed 
25 in claim 6. 

Claims 7 to 10 and 15 to 24 relate to probes or mixtures of such probes for detecting a target 
sequence of one or more mycobacteria of the MTC group. Claims 11 to 13 and 16 to 24 relate 
to probes or mbctures of such probes for detecting a target sequence of one or more 
3d mycobacteria other than mycobacteria of the MTC group (MOTT group). Claim 14 relates 
specifically to probes for detecting drug resistant mycobacteria. Claims 25 to 27 relate to the 
use of such probes or mbctures thereof. 

In accordance with claims 28 to 34, the present invention also relates to a method for 
35 detecting the presence of mycobacteria. 

In yet another aspect, the present invention relates to a kit (claim 35 and 36) comprising at 
least one peptide nucleic acid probe as defined in claims 1 to 24. 
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Mycobacteria are characterised by a complex cell wall which contains myollc acids, complex 
waxes and unique glycolipids. It Is generally recognised by those skilled in the art that this wall 
provides rhycobacteria with extreme resistance to chemical and physical stress as compared 
5 to other bacteria, and, accordingly, makes them very difficult to penetrate and lyse. The low 
permeability of the cell wall is considered to be the main reason tor the fact that only very few 
drugs are effective in the treatment of tuberculosis and other mycobacterial infections. As 
described in US 5 682 985, the wall appears further to prevent penetration by nucleic add 
probes. Even with short probes (shorter thian 30 nucleic acids), specific staining is low or often 
10 non-existent. Protocols that allow DNA probes to be used for In situ hybridisation to 

mycobacterial species are described in US 5 682 986. However, these protocols require 
dewaxing of the mycobacterial cell wall with xylene and further enzymatic treatment prior to 
the hybridisation steja in order to make the mycobacterial cell wall permeable to the DNA 
probes. 

15 

The problems set forth above have surprisingly been confipletely solved by the use of peptide 
nucleic acid probes. It has, surprisingly, been found that the peptide nucleic acid probes are 
able to penetrate the ceil wall of the mycobacteria, and furthermore that this is taking place 
rapidly. The person skilled in the art would arrive at the convictton that it would be necessary 

20 to heavily treat the mycobacteria before hybridisation is can-led out. Thus, based on the 
available prior art, there is a strong prejudice against carrying out hybridisation without prior 
destruction of the mycobacterial cell wall. The inventors have shown that this is indeed and 
unexpectedly possible. It has been demonstrated that the probes of the present invention are 
able to hybridise to mycobacterial precursor rRNA and rRNA without harsh treatment of the 

25 mycobacterial cells, thus avoiding a risk of interiering with the morphology of the cells. Using 
the present probes, specific and easy detection and, subsequently, diagnosis of tuberculosis 
and other mycobacterial infections are thus possible. 

BRIEF DESCRIPTION OF THE FIGURES 

30 

Alignments of rDNA sequences of M. tuberculosis (as a representative of the MTC group) and 
important closely related species thereto, including M. avium (as a represeritative of the 
mycobacteria other than those of the MTC group) and important closely related species 
thereto for the 23S. 16S and/or 5S rRNA genes have been made (Figures 1 A-1J, 2A-2D, 3, 
35 4A-4L and 5A-B). The alignment for M. bovis and M. intracellulare are partly based on public 
available sequences and partly on sequences obtained by sequencing performed at DAKO 
A/S. 
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Alignment for theMTC group (23S rDNA) 

Figures 1A-1J show alignments of 23S rDNA sequences of M. tuberculosis (GenBank entry 
6B:MtCY130, accession number Z73902), M. avium (GenBank entry GB:MA23SRNA, 
accessksn number X74494). M. paratubercutosis (GenBank entry GB:MPARRNA, accession 
5 nurhber X74495), M. phlei (GenBank entry GB:MP23SRNA, accession number X74493X M. 
leprae (GenBank entry GB:ML5S23S. accesston number X56657), M, gastri (GenBank entry 
GB:MG23SRRNA, accession number Z1 721 1 ). IVI. kansasii (GenBank entry 
GB:MK23SRRNA, accession number Z1 721 2). and M. smegmatis (GB:MS16Si23S5, 
accession number Y08453). Prefen-ed peptide nucleic acid probes should enclose at least one 

10 nuclebbase complementary to a nucleobase of M. tuberculosis 23S rRNA within positions 

149-158, 220-221. 328-361, 453-455. 490-501, 637-660. 706-712, 762-789, 989, 1068-1072, 
1148. 1311-1329, 1361-1364, 1418, 1563-1570. 1627-1638, 1675-1677. 1718. 1734-1740. 
1967-1976, 2403-2420, 2457-2488. 2952-2956, 2966-2969. 3000-3003. and 3097-3106 of the 
alignment (indicated by heavy frames). Differences between the sequences of M. avium. 

15 phlei, 1^. leprae, M. paratuberculosis, M. gastri and M. kansasii and that of M, tubercutosis in 
the alignment are indicated by light frames. 

Alignment for the MTC group (16S rDNA) 

Figures 2A-2D show alignments of 16S rDNA sequences of M. tuberculosis (GenBank entry 

20 GB:MTU16SRN, accession number X52917), M. bovis (GenBank entry GB:MSGTGDA, 
accession number M20940), M. avium (GenBank entry GB:MSGRRDA, accession number 
M61673), M. intracellulare (GenBank entry GB:MIN16SRN. accession number X52927), M. 
paratuberculosis (GenBank entry GB:MSGRRDH. accession number M61680), M. 
scrofulaceum (GenBank entry GB:MSC16SRN, accession number X52924), M. leprae 

25 (GenBank entry GB:MLEPl6S1 . accession number X55587), M. kansasii (GeiiBank entry 
GB:MKRRN16, accession number XI 5916), M. gastri (GenBank entry GB:MGA16SRN, 
accession number X5291 9), M. gordonae (GenBank entry GB:MSGRR16SI. accession 
number M29563) and M. marinum (GenBank entry GB:MMA16SRN, accesston number 
X52920). Pr^fenned peptide nucleto add probes shouki enclose at least one nucleobase 

30 complementary to a nucleobase of M. tuberculosis 16S rRNA within positions 76-79, 98-101 , 
135-136, 194-201, 222-229. 242. 474. 1136-1145, 1271-1272. 1287-1292, 1313. and 1334 Of 
the alignment (indicated by heavy frames). Differences between the sequences of M. bovis, M. 
avium, M. intracellulare, M. paratuberculosis, M. scrofulaceum, M. leprae, M. kansasii, M. 
gastri, M. gordonae and M. marinum, and that of M. tuberculosis in the alignment are indicated 

35 by light frames. 



Alignment for the MTC group (5S rDNA) 

Figure 3 shows alignments of 5S rDNA sequences of M. tuberculosis (GenBank entrf 
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GB:MTDNA16S, accession number x75601). M. bovis (GenBank entry GB:MBRRN5S, 
accession number X05526), M. phlel (GenBank entry GB:MP5SRRNA. accession number 
X55259). M. leprae (GenBank entry GB:ML5S23S, accession number X56657), and M. 
smegmatis (GenBank entry 6B:IVIS16S23S5, accession number Y08453). Preferred peptide 
5 nucleic acid probes should enclose at least one nucleobase complementary to a nucleobase 
of M. tuberculosis 5S rRNA within positions 86-90 of the alignment (indicted by heavy frame). 
Differences between the sequences of M. bovis, M. phlei, M. leprae, smegmatis and M. 
luteus and that of M. tuberculosis in the alignment are indicated by light frames. 

10 Alijgnment for Mycobacteria other than those of the MTC group (23S rDNA) 

Figures 4A-4L show alignments of 23S rDNA sequences of IVI. avium (GenBank entry 
GB:M/^3SRiNA, accession number X74494), IVI. paratubercuiosis (GenBank entry 
GB:MPARRNA, accession number X74495). M. tuberculosis (GenBank entry GB:MTCY130, 
accession number Z73902), M. phlei (GenBank entry GB:MP23SRNA. accession number 

15 X74493). Wi. leprae (GenBank entry GB:ML5S23S. accession number X56657), M. gastri 
(GenBank entry GB:MG23SRRNA. accession number Z17211). M. kansasii (GenBank entry 
GB:MK23SRIRNA, accession number Z1 7212), and smegmatis (GB:IVIS16S23S5, 
accession number ¥08453). Preferred peptide nucleic acid probes should enclose at least one 
nucleobase cdnnplementery to a nucleobase of M. avium 23S rRNA within positions .99-1 01. 

20 183, 261-271, 281-284, 290-293, 327-335. 343-357. 400-405, 453-462, 587-599, 637-660. 
704-712, 763-789, 1066-1074, 1177-1185, 1259-1265. 1311-1327. 1345-1348. 1361-1364. 
1556-1570. 1608-1613, 1626-1638, 1651-1659. 1676-1677, 1734-1741, 1847-1853. 1967- 
1976, 2006-2010, 2025-2027, 2131-2232. 2252-2255, 2396-2405. 2416-2420, 2474-2478, 
2687, 2719. 2809, 3062-3068, and 3097-3106 of the alignment (indicated by heavy frames). 

25 Differences between the sequences of IVI. paratubercuiosis, M. tuberculosis, M. phlei, M. 
leprae, M. gastri, M. kansasii, and M. smegmatis and that of M. avium in the alignment are 
indicated by light frames. 

Alignment for Mycobacteria other than those of the MTC group (16S rDNA) 
30 Figures 5A-5B show alignments of 1 6S rDNA sequences of M. avium (GenBank entry 
GB:MSGRRDA, accession number M61 673). M. intracellulare (GenBank entry 
GB:MiN16SRN, accession number X52927),t;/I. paratubercuiosis (GenBank entry 
GB:MSGRRbH, acceission number M61680), M. scrofulaceum (GenBiank entry GB: 
MSC16SRN. accession number X52924), M. tuberculosis (GenBank entry GB:MTU16SRN, 
35 accession number X52917), bovis (GenBank entry GB:MSGTGDA, accession number 
I\/I20940), M. leprae (GenBank entry GB:MLEP16S1. accession number X55587), M. kansasii 
(GenBank entry GB:MKRRN16, accession number X15916), and M. gastri (GenBank entry 
GBMGAieSRN, accession number X5291 9), M. gordonae (GenBank entry GB:IVISGRR16SI, 
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accession number M29563), and M. niarinum (GenBank entry GB:MIVIA16SRN. accession 
number X52920). Preferred peptide nucleic acid probes should enclose at least one 
nucteobase complementary to a nucleobase of M. avium 16S rRNA within positions 1 36-1 36, 
472-475, 1136-1144, 1287-1292, 1313, and 1334 of the alignment (indicated by heavy 
5 frames). Differences between the sequenced of M. intracellulare, M. paratuberculosis, M. 
scrofulaceumj M. tuberculosis, M. bovis, M. leprae, M. kansasli, and M. gastri and that of M. 
avium in the alignment are indicated by light frames. 

Drug-resistance 

10 Figure 6 shows a partial M. avium 23S rDNA sequence including positions 2550 to 2589 of 
GenBank entry X74494. Bases In positions where deviations from the wild-type sequence 
have been correlated with macrolide-resistance are framed. Positions 2568 and 2569 in the 
figure con^espond to positions 2058 and 2059, respectively, of E. coli 23S rRNA. 

15 Figure 7 shows a partial M. tuberculosis 16S rdNA sequence including positions 441 to 491 
and 843 to 883 of GenBank entry X52917. Bases in positions where deviations from the wiki- 
type sequence havis been correlated with resistance to streptomycin are framed. Positions 
452, 473. 474, 477, 865, and 8B6 in the figure conrespond to positions 501, 522, 523. 526, 
912, and 913, respectively, of E.coli 16S rRNA. 

20 

SPECIFIC DESCRIPTION 

The present invention provides novel probes for use in rapid and specific, sensitive 
hybridisation based assays for detecting a target sequence of one or more mycobacteria, 
25 which target sequence is located in the mycobacterial rDNA, precursor rRNA, or in the 238, 
16IS or 5S rRNA. The probes to be used in accordance with the present invention are peptide 
nucleic acid probes. Peptide nucleic acids are non-naturally occurring polyamides or 
tx)lythioamides which can bind to nucleic acids (DNA and RNA). Such compounds are 
described in e.g. WO 92/20702. 

30 

We have identified suitable variable regions of the target nucleic acid by comparative analysis 
of generally available rDNA sequences and sequences obtained by sequencing as described 
above. Computers and computer programs, which have been used for the purposes disclosed 
herein, are commercially available. From such alignments, possibly suitable probes can be 
35 identified. The alignments are thus a useful guideline for designing probes with desired 
characteristics. 

When designing the.probes, due regard should be taken to the assay conditions under which 
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the probes are to be used. Stringency is chosen so as to maximise the difference in stability 
between the hybrid fornied with the target nucleic acid and that fomied with the non-target 
nucleic acid. It will typically be necessary to choose high stringency conditions for probes 
where the specificity depends on only one mismatch to non-target sequences. The more 
5 mismatches to non-target sequences, the less demand for high stringency conditions. 

Furthermore, probes should be designed so as to minimise the stability of probe-non-target 
nucleic acid hybrids. This may be accomplished by minimising the degree of complementarity 
to non-target nucleic acid, i.e. by designing the probe to span as many destabilising 

10 mismatches as possible, and/or to include as many additions/deletions relative to the target 
sequence as possible. Whether a probe is useful for detecting a particular mycobacterial 
species depends to some degree on the difference between the thermal stability of probe- 
target hybrids and.prbbernon-target hybrids. For rRNA targets, however, the secondary 
structure of the region of the rRNA rnolecule in which the target sequence is located may also 

15 be of importance. The secondary staicture of a probe should also be taken into consideration. 
Probes should be designed so as to minimise their proclivity to form hairpins, self-dimers, and 
pair-dimers if a mixture of two or more probes is used. 

Mismatohing bases In hybrids formed between peptide nucleic acid probes and nucleic acids 
20 result in a higher thermal Instability than mismatching bases in nucleic acid duplexes of the 
same sequences. Thus, the peptide nucleic acid probes exhibit a greater specificity for a given 
target nucleic acid sequence than a traditional nucleic acid probe, which is seen as a greater 
difference in T^, values for probe-target hybrids and probe-non-target hybrids. The sensitivity 
and specificity of a peptide nucleic acid probe will also depend on the hybridisation conditions 
2i5 used. 

The primary concem regarding the length of the peptide nucleic acid probes is the warranted 
speclficify. t.e. which length provides sufficient specificity for a particular application. The 
optimal length of a peptide nucleic acid probe comprising a particular site with differences in 
30 base composition, e.g. among iselected regions of mycobacterial rRNA, is a compromise 
between the general pattem that longer probes ensure specificity and shorter probes ensure 
that the destabilising differences in base composition constitute a greater portion of the probe. 
Also, due regard must be paid to the conditions under which the probes are to be used. 

35 Peptide nucleic acid sequences are written from the N-terminal end of the sequence towards 
the C-terminal end. A free (unsubstituted) N-terminal end or an N-terminai end terminating 
with an amino acid is Indicated as H, and a free C-temnlnal end is indicated as NHj (a 
cariioxamide group). Peptide nucleic acids are capable of hybridising to nucleic acid 
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sequences in two orientations, namely in antlparallel orientation and in parallel orientation. The 
peptide nucleic acid is said to hybridise in the antlparallel orientation when the N-terminal end 
of the peptide nucleic acid is lacing the 3' end of the nucleic acid sequence, and to hybridise in 
the parallel orientation when the C-terminal end of tte peptide nucleic acid is facing the 5' end 
5 of the nucleic acid sequence. In most applications, hybridisation in the antlparallel orientation 
is preferred as the hybridisation in the parallel orientation takes place rather slowly and as the 
formed duplexes are not as stable as the duplexes having antlparallel strands. Triplex 
formation with a stoichiometry of two peptide nucleic acid strands and one nucleic acid strand 
may occur if the peptide nucleic acid has a high pyrlmidine content Such triplexes are very 
10 stable, and probes capable of forming triplexes may thus be suitable for certain applications; 

iy/Ialnly because the peptide nucleic acid strand is uncharged, a peptide nucleic acid-nucleic 
acid-duplex will have a higher T,n than the corresponding nucleic acid-nucleic acid-duplex. 
Typically there will be an increase in 7^ of about 1 ''C per base pair at 100 mM NaCl 
1 5 depending on the sequence (Egholm et at. (1 993). Nature. 365, 566-568). 

In contrast to DNA-DNA-dupiex formation, no salt Is necessary to facilitate and stabilise the 
formation of a peptide riucleic acid-DNA or a peptide nucleic acid-RNA duplex. The T^ of the 
peptide nucleic acid-DNA-duplex changes only little with increasing Ionic strength. Typically for 

20 a 15-rher, the T^ will drop only 5 ""C when the salt concentration is raised from 10 mM NaCI to 
1 M NaCI. At low ionic strength (e.g. 10 mM phosphate buffer with no salt added), 
hybridisation of a peptide nucleic acid to a target sequence is possible under conditions where 
no stable DNA-DNA-duplex formation occurs. Furthermore, target sites that normally are 
inaccessible can be made nriore readily accessible for hybridisation with peptide nucleic add 

is probes at low salt conceritratlbn as the secondary and tertiary structure of nucleic acids are 
destabilised under such cbndltiohs. Using peptide nucleic acid probes, a separate 
destabilising step or use of destabilising probes may not be necessary to perform. 

rRNA is essential for proper function of the ribosomes and thus the synthesis of proteins. The 
30 genes encoding the rRNAs are in eubacteria located In an operon in which the small subunit 
RNA gene, the 16S rRNA gene, is located nearest the 5' end of the operon. the gene for the 
large subunit RNA, the 23S rRNA gene, Is located distal to the 16S rRNA gene and the 5S 
rRNA gene is located neareist the 3' end of the operon. The three genes are separated by 
spacer regions in which tRNA genes may be found, however, there are none in M. 
35 tuberculosis. The primary transcript of the eubacterial rRNA operon is cleaved by RNaselll. 
This cleavage results in separation of the 16S, the 23S and the 5S riRNA into precursor rRNA 
molecules (pre-rRNA molecules) which besides the rRNA species also contain leader and tail 
sequences. The primary RNase 111 cleavage is normally a rapid process, whereas the 
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subsequent maturation is substantially slower. Precursor rRNA is typically more abundant 
than even strongly expressed mRNA species. Thus, for certain applications, precursor rRNA 
may be an attractive diagnostic target. In order to specifically detect precursor rRNA. a target 
probe should be directed against sequences comprising at least part of the leader or tail 
5 sequences. A target probe may further be directed against sequences of which both part of 
the leader/tail and mature rRNA sequences are constituents. 

Usually, patients having contracted a mycobacterial infection are treated with medicaments 
until no mycobacteria can be found in the sputum. Except for culturing, the presently available 
10 methods do not allow for clear distinguishing between living and dead mycobacteria. This 
means that a patient may often be treated with medicaments for a longer period of time than 
actually necessary. A way of determining the progress of treatment would be a very valuable 
tool in the fight of tuberculosis and other mycobacterial diseases. 

15 As transcription and maturetion of rRNA is a measure of viability, detection of precursor rRNA 
is a suitable and direct measure of viability of the bacteria. Furthennore, precursor rRNA may 
be used for identification of antibiotic drugs which reduce or inhibit rRNA transcription. One 
such example is rifampicin. A transcriptional inhibitor will in susceptible bacteria eliminate new 
synthesis of rRNA and thus the pool of precursor rRNA will be depleted, l-lowever. in resistant 

20 cells, primary transcripts as well as precursor rRNAs will continue to be produced. 

Although it is preferred to use peptide nucleic add probes targeting specific sequences of 
rRNA, it will readily be understood that peptide nucleic acid probes complementary to rRNA 
targeting probes will be useful for the detection of the genes coding for said sequence specific 
25 rRNA (rDNA). and peptide nucleic acid probes for the detecting rDNA is hence contemplated 
by the present invention. Although it is prefen^ed to choose the sequence of the probe so as to 
enable the probe to hybridise to its target sequence in antlparallel orientation, it is to be 
understood that probes capable of hybridising in parallel orientation can be constructed from 
the same information. The present invention is intended to cover both types of probes. 

30 

In ttie broadest aspect, the present invention relates to peptide nucleic add prebes for 
detecting a target sequence of one or more mycobacteria optionally present in a test sample, 
said probe being capable of hybridising to a target sequence of mycobacterial rDNA, 
precursor rRNA or rRNA (daim 1). 

35 

The probes of the invention may suitably be directed to rDNA. precursor rRNA. or to 23S, 16S 
or 5S rRNA. 
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In accordance with claim 3, the target sequences, to which the peptide nucleic acid probe(s) 
are capable of hybridising to, are obtainable by 

(a) conniparing the nudeobase sequences of said mycobacterial rRNA or rDNA of one or more 
5 mycobacteria to be detected with the corresponding nucleobase sequence of organism(s), in 

particular other mycobacteria, from which said one or more mycobacteria are to be 
distinguished, 

(b) selecting a target sequence of said rRNA or rDNA which includes at least one nudeobase 
differing from the corresponding nudeobase of the organism(s), in particular other 

10 mycobacteria, from which said one or more mycobacteria are to be distinguished, and 

(c) determining the capability of said probe to hybridise to the selected target sequence to 
form detectable hybrids. 

Peptide nucleic acid probes are, in accordance with daim 4, obtainable by 

15 

(a) comparing the nudeobase sequences of said mycobacterial ri^NA or rDNA of one or more 
mycobacteria to be detected with the oonresponding nudeobase sequence of organism(s), in 
particular other mycobacteria, in particular other mycobaderia, from which said one or more 
mycobacteria are to be distinguished, 
20 (b) setedihg a target sequence of said rRNA or rONA which includes at least one nudeobase 
differing from the corresponding nudeobase of the organtsm(s), in particular other 
mycobacteria, from which said one or more mycobacteria are to be distinguished, 
(c) synthesising said probe, and 

(4) determining the capability of said probe to hybridise to the selected target sequence to 
25 fomi detectable hybrids. 

The probes are in particular suitable for detecting a target sequence of one or more 
mycobacteria of the Mycobacterium tuberculosis Complex (MTC) or for detecting a target 
sequence of one or more mycobacteria other than mycobacteria of the Mycobacterium 
30 tuberculosis Complex (MOTT) optionally present in a sample, which probe comprises from 6 
to 30 polymerised peptide nucleic add nroieties, said probe being capable of hybridising to a 
target sequence of mycobacterial rDNA, precursor rRNA or 23S. 16S or 5S rRNA forming 
detectable hybrids (dalm 5). In accordance with claim 6, such probes may comprise peptide 
nucleic acid moieties of formula (I) 
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(1) 



wherein each X and Y independently designate 0 or S, 

each Z independently designates O, S, NR\ or C(R^)2, wherein each independently 
designate H, C^^ alkyi, alkenyl. C,^ alkynyl, 

10 each R^. R^ and R"^ designate independently H, the side chain of a naturally occurring amino 
acid, the side chain of a non-naturally occurring amino acid, C^^ alkyi, Ci^ alkenyl or 
alkynyi. or a functional group, each Q independently designates a naturally occurring 
nucleobase, a non-naturally occurring nuclebbase, an intercaiator, a nucleobase-binding 
group, a label or H, 

15 - and with the proviso indicated in claim 6. 

The probes may suitably be used for detecting a species specific mycobacterial target 
sequence, or target sequences of a group of mycobacteria like MTC, MOTT. MAC or MAIS. 
The probes may further be designed so as to be capable of hybridising to one or rmte drug 
20 resistant mycobacteria, or. alternatively, to the wild-type corresponding thereto. In the design 
of the probes, sequences between different mycobacteria (one or more) may be taken into 
account as may sequences from other related or non-related organisms (one or more). 

As mentioned above, drug-resistance is an increasing threat to the fight of mycobacterial 
25 infection. Monotherapy with macrolides such as clarithromycin and azithromycin often leads to 
clinically significant drtig-resistance. Clarithromycin arid azithromycin are important dmgs in 
the treatment of infections by espedally M. avium. Comparison between 23S rRNA sequences 
from isolates of M. avium and M. intracellulare with acquired resistance to clarithromycin and 
azithromycin and 23S rRNA sequences from non-resistant strains has revealed that the 
30 maijority of resistant strains have single-point mutations in the 23S rRNA In positions 

conrespdnding to 2058 and 2059 in E. coll 23S rRNA. In the M. avium 23S rRNA sequence 
(GenBank accession number X74494), the coaesponding bases are in position 2568 and 
2569, respectively, as shown in Figure 6. Most susceptible strains have an A residue in one of 
these positions whereas the resistant strains have a C, G or T in position 2058 (E. coli 
35 numbering con-esponding to 2568 in M. avium with GenBank accession number X74494), or 
G or C in position 2059 (E. coli numbering corresponding to 2569 in M. avium with GenBank 
accession number X74494). 
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Single-point mutations in rRNA apparentiy also account to some degree for streptomycin 
resistance. Striaptomydn. the first successful antibiotic drug against tuberculosis, is an 
aminocyciftol glycoside that perturbs protein synthesis at the ribosomal level. The genetic 
biasis for streptomycin resistance has not yet been completely solved. However, some 
5 streptomycin resistant strains of M. tuberculosis have single-point mutations in 16S rRNA. 
These mutations are located in positions corresponding to bases 501, 622, 523, 526. 912 and 
913 In E. coli 16S rRNA which oonrespond to bases with numbers 452, 473, 474, 477, 865 and 
866. respectively, of M. tuberculosis 168 rRNA (GenBank accession number X5291 7) as 
shown in Figure 7, Most of these mutated nucleotides are Involved in structural interactions 
10 within the 530 loop of 16S rRNA which is one of the most conserved regions in the entire 16S 
rRNA gene. 

Mutations In an 81 bp region of the gene (rpoB) encoding the beta subunit of RNA polymerase 
are the cause of 96% of the cases of rlfampicln resistance in M. tuberculosis and M. leprae. 

15 rRNA precursor molecules have terminal domains (tails) which are removed during late steps 
in precursor rRNA processing to yield the mature rRNA molecules. Transcriptional inhibitors 
such as rifampicin can deplete precursor rRNA in sensitive cells by inhibiting de novo 
precursor rRNA synthesis while allowing maturation to proceed. Thus, precursor rRNA is 
depleted in susceptible mycobacterium cells while it remains produced In resistant 

20 mycobacterium cells when the cells are exposed to rifarhplcin during culturing. 

Peptide nucleic acid probes for detecting a target sequence of one or more mycobacteria of 
the Mycobacterium tuberculosis Complex are defined in claims 7 to 10. Peptide nucleic acid 
probes for detecting a target sequence of one or more mycobacteria other than mycobacteria 
25 of the Mycobacterium tuberculosis Complex are defined in claims 1 1 to 1 3. Peptide nucleic 
acid probes for detecting a target sequence of one or more drug resistant nnycobacteria of the 
Mycobacterium tuberculosis complex or of one or more drug resistant mycobacteria other than 
mycobacteria of the Mycobacterium tuberculosis Complex are defined in claim 14. 

30 In the present context and the claims, the temn "naturally occumng nucleobases" includes the 
four main DNA bases (i.e. thymine (T), cytosine (C), adenine (A) and guanine (6)) as well as 
other naturally occurring nucleobases (e.g. uracil (U) and hypoxanthine). 

The term "hon-naturally occurring nucleobases" comprises La. modified naturally occurring 
35 nucleobases. Such non-naturally occumng nucleobases may be modified by substitution by 
e.g. one or more C,^ alkyi, C,^ alkenyl or Ci.8 alkynyl groups or labels. Examples of non- 
naturally occurring nucleobases are purine, 2,6-diamino purine, 5-propynylcytosine (C 
propynyl). isocytosine (iso-C). 5-methyl-isocytosine (iso*^C) (see e.g. Tetrahedron Letters Vol 
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36. No 12. 2033-2036 (1995) or Tetrahedron Letters Vol 36. No 21. 3601-3604 (1995)). 7- 
deazaadenine, 7-deazaguanine. N*-ethanocytosine. N®-ethano-2.6-diaminopurine, 5-(Cm)- 
alkenyluracii. S-(Ca«)-atkynyl(^tosine. 5-fluorouracit and pseudocytosine. 

Examples of useful intercalators are e.g. acridin. antraqulnone. psoralen and pyrene. 

Examples of useful nudeobase-blnding groups are e.g. groups containing cyclic or 
heterocyclic rings. Non-limiting examples are 3-nitro pyrrole and 5-nItro indole. 

It Is to be understood that all<yl. alkenyl and alkynyl groups may be branched or non-branched, 
cyclic or non-cyclic, and may further be interrupted by one or more heterbatoms, or may be 
unsubtituted or substituted by or may contain one or more functional groups. Non-limiting 
examples of such functional groups are acetyl groups, acyl groups, amino groups, carbamldo 
groups, carbariTdyl groups, carbamyl groups, carbonyl groups, carboxy groups, cyano groups, 
dlthio groups, fonmyl groups, guanidino groups, halogens, hydrazine groups, hydrazo groups, 
hydroxamino groups, hydroxy groups, keto groups, mercapto groups, nitra groups, phospho 
groups, phosphono groups, phospho ester groups, suifo groups, thioc^anato groups, cyclic, 
aromatic and heterocyclic groups. 

groups contain from 1 to 4 carbon atoms. Ci.6 groups contain from 1 to 6 carbon atoms, 
and Ci.i5 groups contain from 1 to 15 carbon atoms, not Including optional substituents, 
heteroatoms and/or functional groups. Non-limiting examples of such groups are -CH3, -CF3. 
-CH2-, -CH2CH3, -CH2CH2-, -CH(CH3)2. -OCH3, .OCH2-. -OCH2CH3, -OCH2CH2- -OCH(CH3)2. 
-0C(0)GH3. -0C(0)CH2-. -C(0)H. -G(0)-. -C(0)CH3, -C(0)OH. -C(0)0-. .CH2NH2, -CH2NH., 
-CHjOCHg. .CH2OCH2-. -CH20C(0)0H. -CH20C(0)0-. -CH2C(0)CH3, -CH2C(0)CH2-. 
-C(0)NH2, -CH=CH2, -CH^CH-, -CH=CHCH2C(0)0H, -CH=CHCH2C(0)0-. -CsCH, -CsC-. 
-CHaCsCH. -CHjCsC-, -CHaCsCCHa. ^CHjCfeCH. -OCHzCbO. -OCHjCsCCHa, 
-NHCHaqO)-. .NHCH2CH2C(0)-. -NH(CH2CH20)2CH2C{0).. and H0(0)CCH2C{0)(NH. 
(CH2CH20)2CH2C(0))r. phenyl, benzyl, naphthyl. oxazolyl, pyridinyl, thiadlazolyl, triazolyl, 
and thienyl. 

Within the present context, the expression "naturally occuning amino acid" is intended to 
comprise D- and L-forms of amino acids commonly found in nature, e.g. D- and L-fonns of Ala 
(alanine). Arg (arginine). Asn (aspargine). Asp (aspartic acid). Cys (cysteine), Gin (glutamihe). 
Glu (glutamic acid), His (histidine). lie (isoleucine). Leu (leucine), Lys (lysine), Met 
(rioethionine). Phe (phenylalanine). Pro (proline). Ser (serine), Thr (threonine), Trp 
(tryptophan), Tyr (tyrosine) and Val (valine). 
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In the presient context, the expression "non-naturally occurring amino acid*' is intended to 
comprise D- and L-fonms of amino acids other than those commonly found in nature as well as 
modified naturally occurring amino acids. Examples of useful non-naturally occurring amino 
adds are b- and L-fomis of p-Ala (Alanine) Cha (cyclohexylalanine). Cit (dtrulline), Mci 
5 (homocitrulline), HomoCys (homocystein). Hse (homoserine), NIe (norleucine), Nva 
(norvaline). Orn (ornithine). Sar (sarcosine) and Thi (thienylalanine). 

In the present coritext the temi "sarnple" is intisnded to cover all types of samples suitable for 
the purpose of the invention. Examples of such samples are sputum, laryngeal swabs, gastric 
10 lavage, bronchia! washings, biopsies, aspirates, expectorates, body fluids (spinal, pleural. 

pericardial, synovial, blood, pus, bone marrow), urine, tissue sections as well as food samples, 
soil, air and water samples. Analysis of samples originating from the before-mentioned 
samples (e.g. cultures and treated samples) are also within the scope of the invention. 

15 In the present context, the term "hybrids" is intended to include complexes befcween a probe 
and a nucleic acid to be determined. Such hybrids may be made up of two or rriore strands. 

The strength of the binding between the probe and the target nucleic acid sequence rriay be 
influenced by the ligand Q. When Q designates a nucleobase. Hoogsteen and/or Watson- 

20 Crick base pairing assist(s) in the formation of hybrids between a nucleic acid sequence to be 
detected and a probe. It is contemplated that one or more of the ligands may be a group which 
contribute little or none to the binding of the nucleic acid such as hydrogen. It is contemplated 
that suitable probes to be used comprise less than 25% by weight of peptide nucleic acid 
moieties, wherein Q designates such groups. One or more of the ligands Q may be groups 

25 that stabilise nucleobase stacking such as intercalators or nucleobaise-blnding groups. 

In the above-indicated probes, one oir more of the Q-groups may designate a label. Examples 
of suitable labels are given below. Moieties wherein Q denotes a label may preferably be 
located In one or both of the terminating mdieties of the probe. Moieties wherein Q denotes a 
30 label miay, however, also be located internally. 

The peptide nucleic acid probes may comprise moieties, wherein all X groups are O 
(pblyamides) or wherein all X groups are S (polythioamides). It is to be understood that the 
probes may also comprise mixed moieties (comprising both amide and thioamide moieties). 

35 

In another aspect, the present invention relates to peptide nucleic acid probes of formula (II), 
(111) and (IV) as well as mixtures of such probes defined in claim 15. 
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In a preferred embodiment, the peptide nucleic add probes or mixtures thereof according to 
the invention are of formulas (IHIV) as defined in claim 16 with Z being NH, NCH3 or O, each 
R^ and independently being H or the side chain of a naturally occuning amino acid, the 
side chain of a non-naturally occurring amino acid, or allcyl, and each Q being a naturally 
5 occurring nudeobase or a non-naturally occumng nudeobase with the provisos defined in 
daims 6 to 14. 

Peptide nucleic acid probes or mbctures of such probes according to the invention are 
preferably those of fomriula (i)-(IV) as defined In claim 17 with 2 being NH or O, and being 
10 H or the side chain of Ala, Asp, Cys, Glu, His, HomoCys, Lys, Om, Ser or Thr, and Q being a 
nudeobase selected from thymine, adenine, cytosine. guanine, uracil, iso-C, and 2.6- 
diamihopurine with the provisos defined in claims 6 to 14. 

Peptide nucleic acid probes or mixtures thereof, which are of major interest for detecting 
15 ' mycobacteria of the MTC group or one or more mycobacteria other than mycobacteria of the 
l\1TC group, are probes of formula (V) according to claim 18, wherein R^ is H or the side chain 
of Ala, Asp, Cys, Glu, His. HomoCys, Lys. Om, Ser or Thr, Q Is as defined in claim 17 and 
with the provisos indicated in claims 6 to 14. 

20 The peptide nucleic acid probe comprises polymerised moieties as defined above and in the 
daims. From the formula, it is to be understood that the probe may comprise polymerised 
moieties which structure may be mutually different or identical. In some cases, it may be 
advantageous that at least one moiety of the probe, preferably one (or both) of the moieties 
terminating the probe, are of a different structure. Such terminating moieties may suitably be a 

25 moiety of formula (VI) 




30 

where Q is as defined above. Such moiety may suitably be connected to a peptide nucleic 
acid moiety through an amide bond. 

The peptide nucleic acid probe according to the invention comprises from 6 to 30 polymerised 
35 moieties of formulas (I) to (V), and. in addition, optionally one or two terminating moieties of 
formula (VI) as defined above. The preferred length of the probe will depend on the sample 
material and whether labelled probes are used. It Is contemplated that espedally interesting 
probes comprise from 10 to 30 polymerised moieties of formulas (I) to (V), and, in addition, 
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optionally one or two terminating moieties of fbmiula (VI) as defined above. Probes of the 
invention may suitably comprise from 12 to 25 polymerised moieties of formulas (i) to (V), 
more suitably from 14 to 22 polymerised moieties of fomnulas (I) to (V), most suitably from 1 5 
to 20 polymerised nrK)ieties of fbmnulas (I) to (V). and, in addition, optionally one or two 
5 tenDinating moieties of formula (VI). 

As mentioned above, the polymerised moieties of the probes may be mutually different or 
identical. In some embodiments, the polymerised moieties of fbnrtulas (V) constitute at least 
75% by weight (calculated by excluding labels and linkers), preferably at least 80% by weight 
10 and most preferably at least 90% by weight of the probe. 

The ends on the moieties terminating the probe may be substituted by suitable substituents 
which in the following will be named "linkers". A terminating end may suitably be substituted by 
from 1 to 5 linkers, more suitably from 1 to 3 linkers. Such linkers may suitably be selected 

15 aniong Ci.is alKy^ Ci.i6 alkeriyi and C^.^g alkynyl groups as defined above. The linkers may be 
substituted or unsubstituted, branched or non-branched, or be interrupted by heteroatoms. or 
be substituted or contain functional groups as described above. This may depend on the 
chemical nature of the terminating moiety (i.e. whether the moiety Is terinlnated by a carbon, 
oxygen or nitrogen atom). A terminating end or a linker on a terminating end may further be 

20 substituted by one or more labels, which labels may be incorporated end to end, i.e. so as to 
form a non-branched labelled end, or may be incorporated so as to form a branched labelled 
end ("zipper*). The linkers may be attached directly to a terminating end. may be attached to a 
label or between labels on a terminating end, or be attached to a terminating end before a 
label is attached to a terminating end. It should be understood that two terminating ends may 

25 cany different or identical substituents, linkers and/or labels. It should further be understood 
that the iem "a labiBl" is intended to comprise one or nrore labels as the tenn "linkers" is to 
cornprise one or more linkers. For certain applicatfons, it may be advantageous that one or 
more linkers are incorporated between the peptide nuclete acid moieties. Such applications 
may in particular be those based on triplex fonnatlon. 

30 

Examples of suitable linkers are -NH(CH2CH20)„CH2C(0)-. -NH(CHOH)„C(OK 
-(0)C(CH20CH2)„C(0)- and -NH(CH2)„C(0)-. NH2(CH2CH20)nCH2C(0)-. NH2(CH0H)nC(0).. 
HO(0)C(CH20CH2)nC(0)-. NH2(CH2)„C{0)., -NH(CH2CH20)„CH2C(0)OH. 
-NH(CHOH)„C(0)OH, -(0)C(CH20CH2)„C(0)OH and -NH(CH2)„C(0)0H. wherein n is 0 or an 
35 integer from 1 to 8. preferably from 1 to 3. Examples of very interesting linkers are 
-NHCH2C(0)-, -NHCH2CH2C(0)^ -NH(CH2CH20)2CH2C(0)-, and 
HO(0)CCH2CH2C(0)(NH(CH2CH20)2CH2C(0))2.. 
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In the present context, the term "label" refers to a substituent which is useful for detection or 
visualisation. Suitable labels comprtse fluorophores, biotin, dinltro benzoic acid, digoxigenin, 
radioisotope labels, peptide or enzyme labels, chemiluminiscence labels, fluorescent particles, 
hapten, antigien or antibody labels. 

5 

The expression "peptide label" is Inteitded to mean a label comprising from 1 to 20 naturally 
occuning or noh-naturally occurring amino acids, preferably from 1 to 10 naturally occumng or 
non-naturally occumng amino acids, more preferably from 1 to 8 naturally occurring or non- 
naturally occurring amino acids, most preferably from 1 to 4 naturally occuning or non- 
10 naturally occurring amino acids, linked together end to end in a non-branched or branched 
("zipper") fashion. Such peptide label may be composed of amino acids which are mutually 
identical or different. In a prefenred embodiment, such a non-branched or branched end 
comprises one or more, preferably from 1 to 8 labels, more preferably from 1 to 4, most 
preferably 1 or 2. further labels other than a peptide label. Such further labels may suitably 
15 temninate a non-branched end or a branched end. One or more linkers may suitably be 

attached to the tenrhinating end befere a peptide label and/or a further label is attached. Such 
linker unite may also be attached between a peptide label and a further label. Furthermore, 
such peptide labels may be incorporated between the peptide nucleic acid moieties. 

20 The probe as such may also comprise one or more labels such as from 1 to 8. preferably from 
1 to 4. most preferably 1 or 2, labels and/or one or more linker units, which may be attached 
internally, i.e. to the backbone of the probe. The linker units and labels may mutually be 
attached as described above. 

25 Examples of particular interesting labels are biotin, fluorescein labels, e.g. 5-(and 6)-carboxy- 
fluorescein, 5- or 6-cart50xyfluorescein, 6-(fluorescein)-6-(and 6)-carboxamido hexanoic acid 
and fluorescein Isottiiocyanate, peptide labels consisting of from 1 to 20 naturally occurring 
amino acids or non-naturally occumng amino acMs. enzyme labels such as peroxidases like 
horse radish peroxidase (HRP), alkaline phosphatase, and soya bean peroxidase, dinltro 

30 benzoic acid, rhodamine. tetramethylrtiodamine. cyanine dyes such as Cy2, Cy3 and Cy5, 
coumarin. R-phycoerythrin (RPE), allophycoerytiirin, Texas Red, Princeton Red, and Oregon 
Green as Well as conjugates of R-phycoerythrin and, e.g. Cy5 or Texas Red. 

Examples of prefen-ed labels are biotin, fluorescent labels, peptide labels, enzyme labels and 
35 dinitro benzoic acid. Peptide labels may preferably be composed of from 1 to 10. more 

preferably of from 1 to 8, most preferably of from 1 to 4, naturally occurring or non-naturally 
occuning amino adds. It may be particularly advantageous to incorporate one or more other 
labels as well as a peptide label such as from 1 to 8 or from 1 to 4 other labels, preferably 1 or 
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2 other labels. 

Suitable peptide labels may preferably be composed of cysteine, glycine, lysine or ornithine. 

In a further embodiment, the Q substituent as defined above may be labelled. Suitable labels 
are as defined above. Between Q and such a label, a linker as defined above may be 
incorporated. It is prefen^d that such labelled ligands Q are selected from thymine and uracil 
labelled in the 5-positibn and 7-deazaguanlne and 7-dea2aadenine labelled in the 7-posltion. 

A mixture of peptide nucleic acid probes is also part of the present invention. Such mixture 
may comprise more than one probe capable of hybridising to 238 rRNA, and/or more than one 
probe capable of hybridising to 168 rRNA, and/or or more than one probe capable of 
hybridising to 68 rRNA. A mixture of probes may further comprise probe(s) directed to 
precursor rRNA and/or rDNA. The mixture may also comprise peptide nucleic acids for 
detecting more than one mycobacteria in the. same assay. 

In a preferred embodiment, the nudeobase sequence of the peptide nucleic acid probe Is 
selected so as to be substantially complementary to the nudeobase sequence of the target 
sequence In question. In an especially prefenBd embodiment, the nudeobase sequence of the 
peptide nucleic acid probe is selected so as to be complementary to the nudeobase sequence 
of the target sequence In question. By "complementary" is meant that the nucleobases are 
selected so as to enable perfect match between the nudeobases of the probe and the 
nudeobases of the target, I.e. A to T or G to C. By substantially complementary is meant that 
the peptide nucleic acid probe is capable of hybridising to the target sequence, however, the 
probe does not necessarily have to be perfectly complementary to the target. For example, 
probes cdmprlsing one or more bases not complementary to the target sequence and non- 
target sequences, especially base(s) located at the end of the probe, vi^ere the effect on the 
stability of probe-target nucleic acid hybrids is low. Another example is probes comprising 
other naturally occurring bases. Thus provided that the probe is capable of hybridising to the 
target sequence, the nudeobase difference(s) between target sequences and non-target 
sequences ensures that the stability of probe-non-target nudeic add hybrids are lower than 
the stability of probe-target nucleic acid hybrids and therefore make such substantially 
complementary probes applicable for detection of mycobacteria. 

The probes may be synthesised according to the procedures described in "PNA Information 
Package" obtained from Millipore Corporation (Bedford, MA, USA), or may be synthesised on 
an Expedite Mudeic Acid Syntiiesis System (PerSeptive BioSystems. USA). 
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If using the Fmoc strategy for elongation of the probe with linkers or amino acids, it is possible 
to retiain side chain amino groups protected with add sensitive protection groups such as the 
Boc or Mtt group. This methcxl allows introduction of a linker containing several Boc protected 
amino groups which can all be cleaved and labelled in the same synthesis cycle. 

5 

One way of labelling a probe is to use a fluorescent label, such as 5^(and 6)-carboxyfiuore- 
soein, 5- or 6-carbdxyfluorescein, or6-(fluorescein)-5-(and 6)-carboxamido hexanoic acid. The 
acid group is acBvated with HATU (0-(7-azabenzotria2o|.1-yl)-1.1,3,3-tetramethyluronium 
hexafluorophosphate) or HBTU (i2"{1H-ben20triazol-1-yl)-1,l,3,3-tEtramethylurdnium 
10 hexafluorophosphate) and reacted with the N-terminal amino group of the peptide nucleic acid. 
The same technique can be applied to other labelling groups containing an acid function. 
Alternatively, the succinimldyl ester of the above-mentioned labels may suitably be used or 
fluorescein isothiocyanate may be used directly. 

15 After synthesis, probes can be cleaved from the resin using standard procedures as described 
by Millipore Corporation or PerSeptive BioSystems. The probes are subsequently purified and 
analysed using reversed-phase HPLC techniques at 50**C and were characterised by matrix- 
assisted laser desorption/ionisation time of flight mass spectrometry (MALDI-TOFMS), plasma 
desorption mass spectrometry (PDMS), electron spray mass spectrometry (ESMS). or fast 

20 atom bombardment (FAB-I^S). 

Generally, probes such as probes comprising polymerised moieties of formula (IV) and (V) 
may also be prepared as described in, e.g., Angewandte Chemie, international Edition in 
English 35, 1939-1942 (1996) and Bioorganic & Medical Chemistry Letters, Vol 4, No 8, 1077- 
25 1080 (1994). Chemical properties of some probes are described in. e.g., Nature, 365, 566-568 
(1993). 

The method as claimed can be used for the detection of a target sequence of one or more 
mycobacteria optionally present in a sample. The method and the probes provide a valuable 
30 tool for analysing samples for the presence of such target sequences, hence providing 
infonnation for establishing a diagnosis. 

In the assay method according to the invention, the sample to be analysed for the presence of 
mycobacteria is brought into contact with one or more probes or a mixture of such probes 
35 according to the invention under conditions by which hybridisation between the probe(s) and 
any sample rRNA or rDNA originating from mycobacteria can occur, and the formed hybrids, if 
any, are observed or measured, and the observation or measurement is related to the 
presence of a target sequence of one or more mycobacteria. The observation or 



wo 98/15648 



23 



PCT/Dra7/00425 



measurement niay be accomplished visually or by means of instrumentation. 

Prior to contact with probe(s) accctrding to the Invention, the samples may undergo various 
types of sample processing which include purlficdtion, decontamination and/or concentration. 
5 The sample may suitably be decontaminated by treatment with sodium hypochlorite and 
subsequently centrifuged fbrconcentration of the mycobacteria. Samples e.g. sputum 
samples may be treated with a mucolytic agent such as N-AcetyH-cystein which reduces the 
viscosity of the sample as well as be treated with sodium hydroxide which decontaminates the 
sample, and subsequently centrifuged. Other well-known decontamination and concentration 

10 procedures include the Zephiran-trisodium phosphate method, Petroffs sodium hydroxide 
method, the oxalic acid method as well as the cetylpyridlnium chloride-sodium chloride 
method. Samples may also be purified and concentrated by applying sample preparation 
methods such as filtration, immunocapture, two-phase separation either alone or in 
combination. The sample preparation methods may also be used together with the 

15 centrifugation and decontamination methods mentioned above. 

Samples mlay, either directiy or after having undergone one or more processing steps, be 
analysed in primarily two major types of assays, in situ-based assays and in vitiro-based 
assays. In this context, in situ-based assays are to be understood as assays, in which tine 

20 target nucleic acids remain within the bacterial cell during the hybridisation process. Examples 
are in situ hybridisation (ISH) assays on smears and biopsies as well as hybridisation to whole 
cells which may be in suspension and which subsequently may be analysed by e.g. flow 
cytometry optionally after capture of tfie bacteria onto particles (witti same or different type 
and stze), or by image analysis after spreading of tiie bacteria onto a solid medium, filter 

25 membrane or another substantially planar surface. 

In vitro-based assays are to be understood as assays, in which tiie target nucleic adds are 
released from the bacterial cell before hybridisation. Examples of such assays are microtiter 
plate-based assays. Many other assay types, in which the released target nucleic acids by 
30 some means are captured onto a solid phase and subsequentiy analysed via a detector 

probe, are feasible and wlttiin tiie scope of the present invention. Even further, In vitro-based 
assays include assays, in which the target nucleic acids are not captured onto a solid phase, 
but in which the hybridisation and signal generation take place entirely in solution. 

35 Samples for in situ-based assays may suitably be applied and optionally be immobilised to a 
support Techniques for applying of a sample onto a solid support depend on the type of 
sample in question and Include smearing and cytocentirifugation for liquid or liquified samples 
and sectioning of tissues for biopsy materials. The solid support may take a wide variety of 
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forms well-known in the art, such as a microscope slide, a filter membrane, a polymer 
membrane or a plate of various materials. 

In the case of in vitro-based assays, the target nucleic acid may be released from the 
5 mycobacterial cells in various ways. Most methods for releasing the nucleic acids cause 
bursting of the cell wall (lysis) fbltowed by extraction of the nucleic acids into a buffered 
solution. As mycobacteria have complex cell walls containing covalently associated 
peptidoglycans, arabinogalactans and in particular mycolic acids, they cannot easily be 
disrupted by standard methods used for the rapid lysis of other bacteria. Possible methods 
10 which are Icnown to give successful lysis of the mycobacterial cell wall include methods which 
involve treatment with organic solvents, treatment with strong chaotroplc reagents such as 
high concentrations of guanldine thiocyanate, enzyme treatment, bead beating, heat 
treatment, sonicatton and/or application of a French press. 

15 Samples to be analysed by In situ assays may be fixed prior to hybridisation. The person 

skilled in the art will readily recognise that the appropriate pracedure will depend on the type of 
sample to be examined. Fixation and/or immobilisation should preferably preserve the 
morphological integrity of the cellular matrix and of the nucleic acids. Examples of methods for 
fixation are flame fixation, heat fixation, chemical fixation and freezing. Flame fixation may be 

20 accomplished by passing the slide through the blue cone of a Bunsen burner 3 or 4 times; 
heat fixation may be accomplished by heating the sample to 80*»C for 2 hours; chemical 
fixation may be accomplished by immersion of the sample in a fixative (e.g. formamide, 
methanol or ethanol). Freezing Is particuiarty relevant for biopsies and tissue sections and is 
usually carried out in liquid nitrogen. 

26 

In one in situ hybridisation assay embodiment, the sample to be analysed is smeared onto a 
substantially planar solid support which may be a microscope slide, a filter membrane, a 
polymer membrane or another type of solid support with a planar surface. The prefenred solid 
support is a microscope slide. After the smear has been prepared, it may optionally undergo 
30 further pre-treatment like treatment with bactericWal agents or additional fixation by immersion 
in e.g. ethanol. The sariiple may also be pre-treated with enzyme(s) which as primary function 
permeabilise the cells and/or reduce the viscosity of the sample. It may further be 
advantageous to perform a pre-hybridisation step in order to block sites which might othenwise 
give raise to non-specific binding. For this purpose, blocking agents like skim milk, and non- 
35 target probes may suitably be used. The components of the pre-hybrldisation mixture should 
be selected so as to obtain an effective saturation of sites in the sample that might othenvise 
bind the probe non-specifically. The pre-hybridisation buffer may suitably comprise an 
appropriate buffer, blocking agent(s). and detergents. 
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During the in situ hybridisation, one or more probes according to the present invention are 
brought into contact with any target rRNA or rDNA Inside the cells in a hybridisation solution 
under suitable stringency conditions. The concentration of the applied probe may vary 
5 depending on the chemical nature of the probe and the conditions under which hybridisation is 
cam'ed out. Typically, a probe concentration between 1 nM and 1 pM is suitable. The 
hybridisation solution may comprise a denaturing agent which allows hybridisation to take 
place at a lower temperature than would be the case without the agent. The denaturing agent 
should be present in an amount effective to increase the ratio between specific binding and 

10 non-specific binding. The effective amount of denaturing agent depends on the type used and 
on the probe or combination of probes. Examples of denaturing agents are fbrmamrde. 
ethylene glycol and glycerol, arid these may preferably be used In a concentration above 10% 
and less than 70%. The preferred denaturing agent is fbrmamide which is used more 
preferably In concentrations from 20% to 60%. most preferably from 30% to 50%. It should be 

15 noted that In several instances it may not be necessary or advantageous to include a 
denaturing agent 

Prior to hybridisation or during hybridisation, a mixture of random probes (probes with random, 
non-selected sequences of optionally different length) may be added in excess to reduce non- 
20 specific binding. Also, one or more non-sense probes (probes with a defined nucleobase 

sequence and length differing from the nucleobase sequence of the target sequence) may be 
added in excess in order to reduce non-specific binding. Also, non-specific binding of 
detectable probes to one or more non-tanget nucleic acid sequences can be suppressed by 
addition of one or more unlabelled or independently detectable probes, which probes have a 
25 sequence that Is complementary to the non-target sequence(s). It is particulariy advantageous 
to add such bloclcing probes when the non-target sequence differs frx>m the target sequence 
by drily one mismatch. 

It may be advantageous to include iiiert polymers which are believed to increase the effective 
30 concentration of the probe(s) In the hybridisation solution. One such macrx)molecuie is dextran 
sulphate which may be used in concentrations of from 2.5% to 15%. The preferred 
concentration range is from 8% to 12% in the case of dextran sulphate. Other useful 
macromoiecules are polyvinylpyrrolidone and ficoll, which typically are used at lower 
concentrations, e.g. 0.2%. It may further be advantageous to add one or more detergents 
35 which may reduce the degree of non-specific binding of the peptide nucleic acid probes. 
Examples of useful detergents are sodium dodecyl sulphate. Tween 20® or Triton X-100* 
Detergents are usually used in concentrations between 0.05% and 1.0%, preferably between 
0,05% and 0.25%. The hybridisation solution may furthermore contain salt. Although it is not 
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necessary to include salt In order to obtain proper hybridisation, It may be advantageous to 
include salt in concentrations from 2 to 500 mM, or suitably from 5 to 100 mM. 

During hybridisation, other important parameters are hybridisation temperature, concentration 
6 of the probe and hybridisation time. The person sidlled In the art will readily recognise that 
optihial conditions must be detiarmihed for each of the above-mentloried parameters according 
to the specific situation, e.g. choice of probe(s) and type and concentration of the components 
of tiie hybridisation buffer, in particular the concentration of denaturing agent. Presence of 
volume excluders may also have an effect 

to 

Following hybridisation, the sample is washed to remove any unbound and any non- 
specifically bound probe, and consequently, appropriate stringency conditions should be used. 
By stringency is meant tf^e degree to which the reaction conditions favour the dissociation of 
the formed hybrids. The stringency may be increased typically by Increasing the washing 

16 temperature and/or washing time. Typically, washing times from 5 to 40 minutes may be 

sufficient. Two or more washing periods of shorter time may also give good results. A range of 
buffers nnay be used, including biological buffers, phosphate buffers and standard citrate 
buffers. The demand for low salt concentration in flie buffers is not as pertinent as for DNA 
probe assays due to the difference response to salt concentration. In some cases.'it is 

20 advantageous to increase the pH of the washing buffer as It may give an Increased signal-to 
noise ratio (see WO 97/18325). This is conceivably due to a significant reduction of ttie non- 
specific binding. Thus, it may be advantageous to use a washing solution wltti a pH value form 
8 to 10.5. preferably frohfi 9 to 10. 

25 Visualisation of bound probe(s) must be carried out with due regard to the type of label 

choisen. There are a wide range of useful probe labels, in particular various fluorescent labels 
such as fluorescein, fhodamine and derivatives thereof. Furttiermore, labels like enzymes 
(e.g. peroxidases and phosphatases) and haptens (e.g. biotin, digoxigenin, dinitro benzoic 
acid) may suitably be applied. In tiie case of fluorescent labels, ttie hybrids formed may be 

30 visualised using a microscope witii a magnification of at least x 250, preferably x 1000. The 
visualisation may forther be carried out using a CCD (charge coupled device) camera 
optionally controlled by a computer. When haptens are used as labels, ttie hybrids may be 
detected using an antibody conjugated witti an enzyme. In ttiese cases, the detection step 
may be based on colorimetiy, fluorescence or luminescence. 

35 

The probes may alternatively be labelled with fluorescent particles having the fluorescent label 
embedded in ttie partidiss (e.g. Estapor K coloured microspheres), located on tfie surface of 
tile particles and/or coupled to ttie surfeces of ttie particles. As ttie particles have to come into 
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contact with the target nucleic acids within the cells, ttie use of fluorescent particles may 
necessitate pretreatment of the bacteria. Relatively small particles e.g. about 20 nm may 
suitable be used. 

5 In another in situ hybridisation embodiment, frozen tissue or biopsy samples are cut into thin 
sections and transferred to a substantially planar surface, preferably microscope slides. Prior 
to hybridisation, the tissue or biopsy may be treated with a fixative, preferably a precipitating 
fixative such as acetone, or the sample is incubated in a solution of buffered fonnaldehyde. 
Alternatively, the biopsy or tissue section can be transferred to a fixative such as buffered 

10 formaldehyde fori 2 to 24 hours and following fixation, the tissue may be embedded in paraffin 
forming a block from which thin sections can be cut Prior to hybridisation, the tissue section is 
dewaxied and rehydrated using standard procedures. Pemrieabilisation (e.g. treatment with 
proteases, diluted acids, detergents, alcohol and/or heat) may in some cases be 
advantageous. The selected method for permeabilisation depends on several factors, for 

15 instance on the fixative used, the extent of fixation, the type and size of sample, and on the 
applied probe. For these types of samples, sample processing, prehybridisation. hybridisation, 
washing and visualisation may be carried out using same or adjusted conditions as described 
above. 

20 In a further embodiment of the in situ assays, the bacterial cells are kept in suspension during 
fixation, prehybridisation, hybridisation and washing are camed out under the same or similar 
conditions as described above. The prefen-ed type of label for this embodiment is fluorescent 
labels. This allows detection of hybridised cells by flow cytometry, recording the intensity of 
fluorescence per cell. Bacterial cells in suspension may further be coupled to particles, 

25 preferably with a size of from 20 nm to 10 pm. The particles may be made of materials well- 
known in the art like latex, dextran, cellutose and/or agarose, and may optionally be 
paramagnetic or contain a fluorescent label. Nomially, bacterial cells are coupled to particles 
using antibodies against the target bacteria, but oUier means like molecular imprinting may 
also be used. Coupling of the bacterial ceils to particles may be advantageous in sample 

30 handling and/br during detection. 

In the embodiments of in situ hybridisation described above, tiie probes according to the 
invention are used for detecting a target sequence of one or more mycobacteria. In a 
preferred embodiment, the probes are suitable for detecting a target sequence of 
35 mycobacteria of the Mycobacterium tuberculosis Complex (MTC), mycobacteria other than the 
Mycobacterium tuberculosis Complex (MOTT). or mycobacteria of the Mycobacterium avium 
Complex (MAC). The probes are further suitable for detecting simultaneously different target 
sequences originating from the same mycobacteria. 
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Samples to be analysed using In vitro-based assays need to undergo a treatment by which the 
nucleic acids are released from the bacterial cells. Nucleic acids nfiay be released using 
organic solvents, strong chaotropic reagents such as high concentraitions of guanidine 
thiocyanate, enzymes, bead beating, heating, sonication and/or application of a French press. 
The obtained nucleic acids may undergo additional purification prior to hybridisation. 

In one in vitro hybridisation embodiment, the sample cornprising the target nucleic acid is 
added to a container comprising immobilised capture probe(s) and one or more probe(s) 
labelled to function as detector probe(s). The hybridisation should be performed under suitable 
stringency conditions. The hybridisation solution may further comprise a denaturing agent, 
blocking probes, inert polymers, detergents and salt as described for the in situ-type assays. 
Likewise, the hybridisation temperature, probe concentration and hybridisation time are 
important parameters that need to be controlled according to the specific conditions of the 
asisay, e,g. choice of peptide nucleic acid probe(s) and concentration of some of the 
ingredients of the hybridisation buffer. If hybridisation of the target nucleic acid to the capture 
prdbe(s) and detector probe(s), respectively, is performed In two separate steps, different 
parameters, in particular different stringency conditions, may be used In these steps. The 
concentration of the capture probe may be higher for in situ assays as hybridisation may be 
controlled better and washing can be performed more efficientiy. 

Jhe capture probes may be irnmobilised onto a solid support by any means, e.g. by a coupling 
reaction between a carboxyiic acid on a linker and an amino derivatised support. The capture 
probe may further be coupled onto the solid support by photochemical activation of 
photoreactlve groups which hkve been attached absorptively to the solid support prior to 
photochemical activation. Such photoreactive groups are described in the US 5 316 784 A. 
The capture probes may further be coupled to a hapten which allows an affinity based 
immobilisation to the solid support One such example is coupling of a biotin to the probe{s) 
and immobilisation via binding to a steptavidin-coated surface. 

The solid support may take a wide variety of forms well-4<nown in tiie art, such as a microtiter 
plate having one or more wells, a filter membrane, a polymer membrane, a tube, a dip stick, a 
strip and particles. Filter membranes may be made of cellulose, celluloseacetate, 
polyvinylidene fluoride or any otiier materials well-known in the art The polymer membranes 
may be of polystyrene, nylon, polypropylene or any other materials well known in the art. 
Particles may be paramagnetic beads, beads made of polystyrene, polypropylene, 
polyethylene, dextran, nylon, amyloses. celluloses, polyacrylamides and agarose. When the 
solid support has the fonm of a filter, a membrane, a strip or beads, it (they) may be 
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incorporated Into a singie-use device. 



The selection of the label of the detector probe(s) depend on the specific assay format and 
possible instrunientation. When biotin labelled probes are used, the hybrids may be detected 
5 using streptavidih or an antibody against ttie biotin label which antibody or streptavidin may be 
conjugated with ah enzyme and the actual detection depend on the choice of the specific 
enzyme, preferably a phosphatase or a peroxidase, and the substrate for the selected 
enzyme. The signal may in some cases be enhanced using commercially available 
amplification systems such as the catalysed signal amplification system for biotinylates probes 

10 (CSA by DAKO). Various polymer-based enhancement systems may also be used. An 
example is a dextran polymer to which both a hapten specific antibody and an enzyme is 
coupled. The detector probe(s) may further be labelled with other haptens, e.g. digoxigenin, 
dinitro benzoic acid and fluorescein, in which case the hybrids may be detected using an 
antibody against the hapten which antibody may be conjugated with an enzyme. It is even 

15 possible to apply detector probe(s) which have enzymes coupled directly onto the prt)bes. 
There are a wide range of possibilities for selection of enzyme substrates allowing for 
colourimetric (substrates e.g. p-nitro-phenyl phosphate or tetra-methyl-benzidine), fluorogenic 
(substrates e.g. 4-rhethyIumbilliferylphosphate) or chemlluminescent (substrates e.g. 1 ,2- 
didxetanes) detection. 

20 

The detector probes may further be labelled witii various fluorescent labels, preferably 
fluorescein or rhodafnine, in which case the hybrids may be detected by measuring the 
fluorescence. 

25 The detector probe(s) will typically be different from the capture probe(s). thus ensuring dual 
species specificity. The dual specificity will most often allow at least one of the probes to be 
shorter. e.g. a 10 mer probe. 

Furthermore, the capture of purine rich sequences may be improved by utilising bis-peptide 
30 nucleic acids as capture probes. Such bis-peptide nucleic acids are described in WO 

96/02558. The bis-peptide nucleic acids coniprise a first peptide nucleic acid strand capable of 
hybridising in parallel fashion to the terget nucleic acid, and a second peptide nucleic acid 
strand capable of hybridising in antiparallel fashion to the purine rich sequence of the nucleic 
acid to be captured. The two peptide nucleic acid strands are connected by a linker and are in 
36 this way capable of forming a triplex structure with said purine rich sequence nucleic acid. The 
number of polymerised moieties of each linker-separated peptide nucleic acid may be as 
previously defined for non-bis-peptide nucleic acids. However, due to the high stebllity of the 
triplexes formed, bis-peptide nucleic acids with short first and second strands can be used 
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making the design of a pyrimidine rich probe easier. 

Instead of using a detector probe, capture probe: nucleic acid complexes may be detected 
using a detection system based on an antibody reacting specifically with complexes fbmied 
5 between peptide nucleic acids and nucleic acids (such as described in WO 95/17430). in 
which detection system the primary antibody may comprise a label, or which detection system 
comprises a labelled secondary antibody, which specifically binds to the primary antibody. The 
specific detection again depends on the selected substrate which may be of any type of those 
mentioned above. 

10 

Depending on the type of specific assay fonnat, label and detection principle various types of 
Instmmentation may be used including conventional microplate readers, luminometers and 
flow cytometers. Adaptation of adequate instrumentation may allow for automatisation of the 
assay. 

15 

In an example of this embodiment a capture probe of the present invention is coupled to a 
microtiter plate by a photochemical reaction between antraquinon-labelled capture probe and 
polystyrene of the micrbwell. Target rRNA is added to the microwells and incubated under 
stringent conditions. Unbound rRNA is removed by washing and the microwell are incubated 
20 with a hapten-labelled detector probe under stringent conditions. The visualisation is carried 
out using an enzyme-labelled antibody against the hapten, which after removal of unbound 
antibody is detected usiiig a chemiluminescence substrate. 

In another example of this embodiment capture probes are coupled to latex particles, and 
25 hybridisation is earned out under suitable conditions in the presence of e.g. fluorescein 

labelled detector probe(s). After hybridisation and optionally washing, the hybrids are detected 
by flow cytometry. A range of different beads (e.g. by size or colours) may carry different 
capture probes for different targets, thus allowing a multiple detection system. 

30 In a further embodiment of the in vitro assays format, the capture probe, the target nucleic 
add and the detector probe may hybridise in solution, and subsequently the capture probe is 
attached to a siolid phase. The solid phase, the hybridisation conditions and means of 
detection may be selected according to the specific method as described above. 

35 In a further embodiment of in vitro assays, the target nucleic acid may be Immobilised onto 
filter or polymer membranes or other types of solid phases well-known in the art The 
hybridisation conditions and means of detection may be selected according to the specific set- 
up as described above. 
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In a further embodiment df the in vitro assay, an array of up to 100 or even more different 
probes directed against different target sequences may be immobilised onto a solid surface 
and hybridisation of the tariget sequences to all the probes is carried out simultaneously. The 
solid phase; the hybridisation conditions and means of detection may be as described above. 
This allow for simultianeous detection or identification of a range of parameters, i.e. species 
identification and resistance patterns. 

The present probes further provide a method of diagnosing infection by mycobacteria and a 
method for determining the stage of the Infection and the appropriate treatment by which 
methods one or more optionally labelled probes according to the invention are brought Into 
contact with a patient sample and the type of treatment and/or the effect of a treatment is (are) 
evaluated. 



Kits comprising at least one peptide nucleic acid probe as defined herein are also part of the 
present Invention. Such Icit may further comprise a detection system with at least one 
detecting reagent and/or a solid phase capture system. 

DESCRIPTION OF SPECIFIC EMBODIMENTS 

Examples of suitable Os of adjacent moieties are given below. Peptide nucleic acid probes 
comprising such Qs will be suitable for detecting mycobacteria, in particular mycobacteria of 
the MTC group or mycobacteria other than mycobacteria of the MTC group. The probes are 
written from left to right corresponding to from the N-terminal end towards the C-terminal end. 
Suitable Q subsequences for detecting 23S and 16S rRNA as well as 5S rRNA of the MTC 
group are given below. Suitable Q subsequences for detecting 23S and 16S rRNA of 
mycobacteria other than mycobacteria of the MTC group are further given below. The Q 
subsequences include at least one nucleobase complementary to a nucleobase selected from 
the positions given in parenthesis. The Q subsequences are given as non-limiting examples of 
construction of suitable probe nucleobase sequences. It is to be understood that the probes 
may comprise fewer or more peptide nucleic acid moieties than indicated. 



MTC group (23S) 

AGA TGC GGG TAG GAG (selected from positions 149^158 in Figure 1 A), (Seq ID no 1) 

TGT TTT CTC CTC OTA (selected from positions 220-221 in Figure 1 A), (Seq ID no 2) 
ACT GOG TCT GAG CCG (selected from positions 328-361 in 

Figure 1 A and Figure 1 B). (Seq ID no 3) 

TGA TAG TAG GCA GGT (selected from positions 453-455 In Figure 1 B). (Seq ID no 4) 

GGG ATT GAG AGG GGA (selected from positions 490-501 In Figure IB). (Seq ID no 5) 
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TCA CCA CCC TCC TCC (selected from positions 637-660 in Figure 1C). 
CCA CCC TCC TCC (selected from positions 637-660 in Rgure 1C) 
TTA ACC TTG CGA CAT (selected from postttons 706-712 in Figure 1 D), 
ACT ATT CAC ACG CGC (selected from positions 762-789 in Figure ID), 
CTC CGC GGT GAA CCA (selected from position 989 in Figure ID). 
GCT TTA CAC CAC GGC (selected from positions 1068-1072 in Figure IE), 
ACG CTT GGG GGC CTT (selected from position 1148 in Figure IE). 
CCA CAC CCA CCA CAA (selected from positions 1311-1329 In Figure IE), 
CCG GTG GCTTCG CTG (selected from positions 1361-1364 In Figure IF), 
ACT TGC CTT GTC GCT (selected from position 1418 in Figure 1F). 
GATTCG TCA CGG GCG (selected from positions 1563-1670 in Figure IF). 
AAC TCC ACA CCC CCG (selected from positions 1627-1638 in Figure 1G), 
ACT CCA CAC CCC CGA (selected from positions 1627-1638 in Figure 1G). 
ACC CCT TCG CTT GAC (selected from positions 1675-1677 In Figure 1G). 
CTT GCC CCA GTG TTA (selected from position 1718 in Figure 1G), 
CTC TCC CTA CCG GCT (selected from positions 1734-1740 In Figure 1 H), 
GAT ATT CCG GTC CCC (selected from positions 1967-1976 in Figure 1H), 
ACT CCG CCC CAA CTG (selected from positions 2403-2420 In Figure 1H). 
CTG TCC CTA AAC CCG (sfetected from positions 2457-2488 in Figure 11), 
TTC GAG GTT AGA TGC (selected from positions 2457-2488 in Figure 1 1), 
GTC CCT AAA CCC GAT (selected from positions 2457-2488 in Figure II), 
GGT GCA CCA GAG GTT (selected from positions 2952-2956 in Figure 11). 
CTG GGG GGA CAA CTG (selected from positions 29i56-2969 in Figure 1J). 
TTA TCC TGA CCG AAC (selected from positions 3000-3003 in Figure 1J). 
GAC CTA TTG AAC CCG (selected from positions 3097-3106 in Figure 1J). 
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(SeqIDnoO) 
(modified Seq ID no 6) 
(Seq ID no 7) 
(Seq ID no 8) 
(Seq ID no 9) 
(Seq ID no 10) 
(Seq ID no 11) 
(Seq ID no 12) 
(Seq ID no 13) 
(Seq ID no 14) 
(Seq ID no 15) 
(Seq ID no 16) 
(Seq ID no 17) 
(Seq ID no 18) 
(Seq ID no 19) 
(Seq ID no 20) 
(Seq ID no 21) 
(Seq ID no 22) 
(Seq ID no 23) 
(Seq ID no 24) 
(Seq ID no 25) 
(Seq ID no 26) 
(Seq ID no 27) 
(Seq ID no 28) 
(Seq ID no 29) 



30 



35 



40 



MTC group (16S) : 

GAA GAG ACC TTT CCG (selected from positions 76-79 in Figure 2A). (Seq ID no 30) 

CAC TCG AGT ATC TCC (selected from positions 98-101 in Figure 2A), (Seq ID no 31) 

ATC ACC CAC GTG TTA (selected from positions 136-136 in Figure 2A). (Seq ID no 32) 

GCA TCC CGT GGT CCT (selected from positions 1 94-201 In Figure 2B), (Seq ID no 33) 

C:AC AAG ACA TGC ATC (selected from positions 194-201 in Figure 2B). (Seq ID no 34) 

TAA AGC GCT TtC CAC (selected from positions 222-229 in Figure 2B), (Seq ID no 35) 

GCT CAT CCC ACA CCG (selected from position 242 in Figure 28). (Seq ID no 36) 

CCG AGA GAA CCC GGA (selected from position 474 in Figure 2C), (Seq ID no 37) 

AGT CCC CAC CAT TAC (selected from positions 1 136-1145 in Figure 2C), (Seq ID no 38) 

AAC CTC GCG GCA TCG (selected from positions 1271-1272 in Figure 2C), (Seq ID no 39) 

GGC TTT TAA GGA TTC (selected from positions 1287-1292 in Figure 2D), (Seq ID no 40) 

GAC CCC GAT CCG AAC (selected from position 1313 in Figure 2D), (Seq ID no 41) 

CCG ACT TCA CGG GGT (selected from position 1334 in Figure 2D). (Seq ID no 42) 



MTC group (5S) 
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CGG AGG GGC AGT ATC (selected from positions 86-90 in Figure 3). (Seq ID no 43) 
Mycobacteria other than those of the MTC group (23S) 

GAT CAA TGC TCG GTT (selected from positions 99-101 In Figure 4A), (Seq ID no 44) 

5 TIC CCC iGCG TTA CCT (selected from position 1 83 In Figure 4A), (Seq ID no 45) 

TTA GCC TGT TCC GGT (selected from positions 261-271 In Figure 4A). (Seq ID no 46) 

GCA TGC GGT TTA GCC (selected from positions 281-284 In Figure 4B), (Seq ID no 47) 

TAG COG GTT GTC CAT (selected from positions 290-293 in Figure 4B), (Seq ID no 48) 
GTA GAG CTG AGA CAT (selected from positions 327-335 and 

10 343-357 in Figure 4B). (Seq ID no 49) 
GCC GTC CCA GGC CAC (selected from positions 400405 in 

Figure 48 and Figure 4C), (Seq ID no 50) 

CTC GGG TGT TGA TAT (selected from positions 453-462 in Figure 4C), (Seq ID no 51) 

ACT ATT TCA CTC CCT (selected from positions 587-599 in Figure 4C), (Seq ID no 52) 

15 ACG CCA TCA CCiC CAC (selected from positions 637-660 In Figure 4D), (Seq ID no 63) 

CGA CGT GTC CCT GAC (selected from positions 704-712 in Figure 4D), (Seq ID no 54) 

ACT ACA CCC CAA AGG (selected from positions 763-789 in Figure 4E), (iSeq ID no 65) 

CAC GCT TTT ACA CGA (selected from positions 1060-1074 in Figure 4E). (Seq ID no 56) 

GCG ACT AGA CAT CCT (selected from positions 1 177-1185 in Figure 4E), (Seq ID no 57) 

' 20 CGG CGC ATA ATC ACT (selected from positions 1259-1265 in Figure 4E), (Seq ID no 58) 

CCA CAT CCA CCG TAA (selected from positions 131 1-1327 In Figure 4F), (Seq ID no 59) 

CGC TGA ATG GGG GAC (selected from positions 1345-1348 in Figure 4F). (Seq ID no 60) 

GGA GCT TCG CTG AAT (selected from positions 1 361 -1364 in Figure 4G), (Seq ID no 61) 

CGG TCA CCC GGA GCT (selected from positions 1 361 -1 364 in Figure 4G), (Seq ID no 62) 

25 GGA CGC CCA TAC ACG (selected from positions 1 556-1 570 in Figure 4G). (Seq ID no 63) 

GAA GGG GAA TGG TCG (selected from positions 1608-1613 In Figure 4H). (Seq ID no 64) 

AAT CGC CAC GCC CCC (selected from positions 1626-1638 In Figure 4H)» (Seq ID no 65) 

CAG CGA AGG TCC CAC (selected from positions 1651-1659 in Figure 4H), (Seq ID no 66) 

GTC ACC CCA TTG CTT (selected from positions 1675-1677 In Figure 4H)/ (Seq ID no 67) 

30 ATC GCT CTC TAC GGG (selected from positions 1 734-1 741 in Figure 4H), (Seq ID no 68) 

GTG TAT GTG CTC GCT (selected from positions 1 847-1 853 in Figure 41). (Seq ID no 69) 

ACG GTA TTC CGG GCC (selected from positions 1967-1976 in Figure 41). (Seq ID no 70) 

GGC CGA ATC CCG CTC (selected from positions 2006-2010 in Figure 41), (Seq ID no 71) 

AAA CAG TCG CTA CCC (selected from positions 2025-2027 in Figure 41), (Seq ID no 72) 

35 CCT TAC GGG TTA ACG (selected from positions 21 31-21 32 In Figure 4J), (Seq ID no 73) 

GAG ACA GTT GGG AAG (selected from positions 2252-2255 in Figure 4J), (Seq ID no 74) 
TGG CGT CTG TGC TTC (selected from positions 2396-2405 In 

Rgure 4J and Figure 4K), (Seq ID no 75) 

CGA CTC CAC ACA AAC (selected from positions 2416-2420 In Figure 4K). (Seq ID no 76) 

40 GAT AAG GGT TCG ACG (selected from positions 2474-2478 in Figure 4K). (Seq ID no 77) 

ATC CGT TGA GTG ACA (selected from position 2687 in Figure 4K), (Seq ID no 78) 

CAG CCC GTT ATC CCC (selected from position 2719 in Figure 4K). (Seq ID no 79) 
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AAC CTT TGG GAC CTG (selected from position 2809 in Figure 4L). (Seq ID no 80) 

TAA AAG GGt GAG AAA (selected frorii positions 3082-3068 in Figure 4L). (Seq ID no 81) 

GTC TGG CCT ATC AAT (selected from positions 3097-3106 In Figure 4L), (Seq ID no 82) 

5 Mycobacteria other ther\ those of the MTC group (16S) 

AGA TTG CCC ACG TGT (selected from positions 135-136 in Figure 5A). (Seq ID no 83) 

AAT CCG AGA AAA CCC (selected from positions 472-475 in Figure 5A), (Seq ID no 84) 

GCA TtA CCC GCT GGC (selected from positions 1136-1 144 in Figure 5A). (Seq ID no 85) 

TTA AAA GGA TTC GCT (selected from positions 1287-1292 in Figure 5B), (Seq ID no 86) 

10 AGA CCC CAATCC GAA (selected from position 1313 In Figure 5B). (Seq ID no 87) 

GAC TCC GAC TTC ATG (selected from position 1 334 In Figure 5B). (Seq ID no 88) 

Drug resistance 

23S-mecilated macrolide resistance (M. avians) 

1 5 GTC TTT TCG TCC TGC (wild-type) (selected from positions 2568-2569 

in Figure 6). (Seq ID no 89) 

GTC TTA TCG TCC TGC (selected from positions 2568 In Figure 6). (Seq ID no 90) 

GTC TTC TCG TCC TGC (selected from positions 2668 in Figure 6), (Seq ID no 91) 

GTC TTG TCG TCC TGC (selected from positions 2568 in Figure 6). (Seq ID no 92) 

20 GTC TAT TCG TCC TGC (selected from positions 2568 In Figure 6), (Seq ID no 93) 

GTC TOT TCG TCC TGC (selected from positions 2568 in Figure 6), (Seq ID no 94) 

GTC TGT TCG TCC TGC (selected from positions 2568 in Figure 6), (Seq ID no 95) 

16S'n)ediated streptomycin resistance (M, tubercutosis) 

25 TTG GCC GGT GCT TCT (wild-type) (selected from positions 452 in Figure 7), (Seq ID no 96) 

TTG GCC GGT ACT TCT (selected from positions 452 in Figure 7), (Seq ID no 97) 

TTG GCC GGT CCT TCT (selected from positions 452 in Figure 7). (Seq ID no 98) 

TTG GCC GGT TCT TCT (selected from positions 452 in Figure 7), (Seq ID no 99) 
ACC GCG GCT GCT GGC (wild-type) (selected ftXHn positions 473-477 

30 In Figure 7), (Seq ID no 100) 

ACC GCG GCT ACT GGC (selected from positions 473 in Figure 7). (Seq ID no 101) 

ACC GCG GCT CCT GGC (selected from positions 473 In Figure 7). or (Seq ID no 102) 

ACC GCG GCT TCT GGC (selected from positions.473 in Figure 7), (Seq ID no 1 03) 

CGG CAG CVG GCA CGT (selected from positions 474 in Rgure 7). (Seq ID no 104) 

35 CGG CCG CTG GCA CGT (selected from positions 474 in Figure 7). (Seq ID no 1 05) 

CGG CTG CTG GCA CGT (selected from positions 474 in Figure 7). (Seq ID no 106) 

CGT ATT ACC GCA GCT (selected from positions 477 in Figure 7), (Seq ID no 107) 

CGT ATT ACC GCC GCT (selected from positions 477 in Figure 7), (Seq ID no 108) 

CGT ATT ACC GCT GCT (selected from positions 477 in Figure 7), (Seq ID no 109) 

40 TTC CTT TGA GTT TTA (wild-type) (selected from positions 865-866 in Rgure 7). (Seq ID no 1 1 0) 

TTC CTT TAA GTT" TTA (selected from positions 865 in Figure 7). (Seq ID no 1 1 1) 

TTC CTT TCA GTT TTA (selected from positions 865 In Figure 7), (Seq ID no 1 12) 
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TTC CTT TTA GTT TTA (selected from positions 865 In Figure 7). 
TTC CTT AGA GTT TTA (selected from positions 866 in Figure 7), 
TTC CTT CGA GTT TTA (setected from positions 866 in Figure 7), 
TTC CTT GGA GTT TTA (selected from positions 866 in Figure 7). 



(SeqID no 113) 
(SeqID no 114) 
(Seq ID no 115) 
(Seq ID no 116) 



Other examples of suitable Q subsequences are given below. 



CAT GTG TCC TGT GGT and 
CGTCAGCCCGAGAAA 



(Seq ID no 117) 
(Seq ID no 118) 



10 



15 



20 



25 



30 



selected so as to be complementary to IW. gordonae 163 rRIMA (positions 174-188 and 452- 
466. respectively, of GenBank entry GB:MSGRR16SI. accession no. M29563). These 
positions conrespond to positions 192-206 and 473-487. respectively, of the alignments shown 
in Figure 2 and 5. Probes having this or a similar nudeobase sequence are suitable for 
detecting M. gordonae. 

CAC TAC ACA CGC TOG, and (Seq ID no 1 1 9) 

TGG CGT TGA GGT TTC (Seq ID no 120) 

selected so as to be complementary to positions 781-795 and 2369-2383, respectively, of M. 
icansasii 23S rRNA (GenBank entry MK23SRRNA accession number Z17212), These 
positions correspond to positions 774-794 and 2398-2412, respectively, of the alignments 
shown in Figure 1 and 4. Probes having this or a similar nudeobase sequence are suitable for 
detecting M. kansasii. 

Precursor rRNA 

AAC ACT CCC TTT GGA (Seq ID no 123) 

A peptide nucleic acid probe having the above-indicated nudeobase sequence is directed to 
M. tubierculosis precursor rRNA. The probe is complementary to positions 602 to 616 of 
GenBank accession number X58890. 

Especially, probes based oh those nudeobase sequences with sequence identification 
numbers Seq ID no 62, 79 and 80 (and other probes selected from positions 1361-1364 in 
Figure IF, 2719 in Figure 4K and 2809 in Figure 4L) are suitable for detecting M. avium. 
Probes based on the nudeobase sequence v^rith sequence identification number Seq ID no 55 
(and other probes selected from positions 763-789 in Figure 4E) are suitable for detecting M. 
avium, M. ihtracellulare and M. scrofulaceum as a group (the organisms tenned the IVJAIS 
group of mycobacteria). In addition, probes based on the nudeobase sequences with 
sequence identification numbers Seq ID no 77 and 81 are suitable for detecting M. avium. M. 
intracellulare and M. paratuberculosis as a group. 
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The inviention is further illustrated by the non-limiting examples given below. 

EXAMPLES 

EXAMPLE 1 

Mycobacterium species (M. bovis and M. intracellulare) 23S rDNA were partly amplified by 
PGR, and the PGR products were sequenced (both strands) using Gy5-labelled 
oligonucleotide primers (DMA Technology, Aarhus, Denmark) and the 7-dea2a-dGTP Thermo 
Sequenase cycle sequencing kit from Amersham, Little Ghalfbnt. England. Sequences were 
read using an ALFexpress automated sequencer and ALFwin (version 1.10) software from 
Pharmacia Biotech, Uppsala, Sweden. M. bovis and M. intracellulare 23S rRNA sequences 
are Included at the following positions of the 23S rDNA sequence alignments: positions 681- 
729 (Figures 1C arid 4D), positions 761-800 (Figures 1D and 4E), positions 2401-2440 
(Figures 1H and 4K). positions 2441-2480 (Figures 11 and 4K), positions 2481-2520 (Figure 
II), positions 3041-3080 (Figure 4L), and positions 3081-3120 (Figures 1J and 4L). 

EXAMPLE 2 

Sequence alignments (see Figures 1 to 5) of 23S, IBS and 5S rDNA of mycobacteria of the 
MTG group, arid 23S and 16S rDNA of mycobacteria other than those of the MTG group 
(MOtT) were done using the Megalign (version 3.12) alignment tool from DNASTAR 
(Madison, Wl, USA). Up to one hundred sequences were aligned at a time. 

Peptide nucleic acid probes in which the nucleobase sequence was complementary to 
distinctive mycobacterial rRNA were designed with due regard to secondary structures using 
the PrimerSelect program (version 3.04) from DNASTAR. As a control of sequence specificity, 
all probe sequences were subsequently matched with the GenBank and EMBL databases 
using BLAST sequence similarity searching at the National Center for Biotechnology 
Ihfonnation (http://www.ncbi.nlm.nih,gov). 

As examples, the following sequences were selected: 



MTG 23S 



TCACCA CCC TCCTCC 



(Seq ID no 6) 
(modified Seq ID no 6) 



CGACCCTCC TCC 



ACTATTCACACG CGC 



(Seq 10 no 8) 
(Seq ID no 12) 



CCACAC CCACCACAA 
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AACTCCACACCCCCG (SeqlDno16) 

ACT CCA CAC CCC CGA (Seq ro no 1 7) 

ACT CCG CCC CAA CTG (Seq id no 22) 

CTG TCC CTA AAC CCG (Seq ID no 23) 

TTC GAG GTT AGA TGC (Seq id no 24) 

GTCCCTAAACCCGAT (Seq ID no 25) 

GAC CTA TTG AAC CCG (Seq id no 29) 

MTC 16S 

GCA TCC CGT GGT CCT (Seq id no 33) 

CACAAGACATGCATC (Seq ID no 34) 

GGCTTTTAAGGATTC (Seq ID no 40) 

MdTT23S 

GAT CAA TGC TCG GTT (Seq ID no 44) 

CGA etc CAC ACA AAC (Seq id no 76) 

MOTT 16S 

GCATTACCCGCTGGC (S6qlDno8^ 
Drug resistance 

GTC TTA TCG TCC TiSC (Seq ID no 90) 

GTC TTC TCG TCC TGC (Seq ID no 91) 

GTC TTG TCG TCC TGC (Seq ID no 92) 

GTC TAT TCG TCC TGC (Seq id no 93) 

GTC TCT TCG TCC TGC (Seq |D no 94) 

GTC TGT TCG TCC TGC " (Seq id no 95) 

Pnicureor rRNA 

AACACTCCCTTTGGA (Seq ID no 123) 
Non-sense probes 

GTC CGT GAA CCC GAT (Seq ID no 121 ) 

TACGCTCTTTGAGCT (Seq ID no 122) 

EXAMPLE 3 



Peptide nucleic acid probes were synthesised using an Expedite 8ig09 Nudeic Acid Synthesis 
System purchased ftiom PerSeptive Biosystems (Framingham, USA). The peptide nucleic acid 
probes were temiinated with two p-alanine molecules or with one or two lysine molecule(s) 
and, before cleavage from the resin, labelled with 5-(or 6)-carboxyfluorescein (Flu) or 
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rhodamine (Rho) at the p-amino group of alanine (peptide label) or e-amino group of lysine 
(peptide label), respectively. Probes were purified using reverse phase HPLC at 50'C and 
characterised using a G2025 A MALDI-TOF MS instrument (Hewlett Packard. San Fernando, 
California. USA). Molecular weights deternriined were within 0,1% of the calculated molecular 
weights. 

The following labelled peptide nucleic acid probes were synthesised: 



MtC23S 

10 Lys(FIU)-Lys(Flu)-TCA CCA CCC TCC TCC-NHa 
Lys(Flu)-Lys(Flu)-CCA CCC TCC TCC-NHj 
Lys(Flu).Lys(Flu)-ACT ATT CAC ACG CGC-NHj 
Lys(FIu)-ACT ATT CAC ACG CGC-NHj 
Lys(Flu)-Lys(Flu)-CCA CAC CCA CCA CAA-NHa 

15 Lys{Flu)-Lys(Flu)-AAC TCC ACA CCC CCG-NHj 
Lys{Flu)-Lys(Flu)-ACT CCA CAC CCC CGA-NH2 
Lys(FIu)-Lys(Flu)-ACT CCG CCC CAA CTG-NHj 
Lys(Flu).Lys(Flu)-CTG TCC CtA AAC CCG-NHj 
Lys(Flu)-Lys(Flu)-TTC GAG GTT AGA TGC-NHj 

20 Lys(Ru)-tTC GAG GTT AGA TGC-NHj 

Lys(FIu)-Lys(Flu)-GTC CCT AAA CCC GAT-NHj 
Lys(Flu)-GTC CCT AAA CCC GAT-NHj 
Lys(Flu).G AC CTA TTG AAC CCG-NHj 



(OK 446/modified Seq ID no 6) 
(OK 575/modified Seq ID no 6) 
(OK 447/modified Seq ID no 8) 
(OK 688/modified Seq ID no 8) 
(OK448/modlfled Seq ID no 12) 
(OK 449/modified Seq ID no 16) 
(OK 309/modified Seq ID no 17) 
(OK 45a/modified Seq ID no 22) 
(OK 305/modified Seq ID no 23) 
(OK 306/modifled Seq ID no 24) 
(OK 682Attodified Seq ID no 24) 
(OK 307/nriodified Seq ID no 25) 
(OK 654/modified Seq ID no 25) 
(OK 660/modified Seq ID no 29) 



25 MTC16S 

Lys{Flu)-Lys(Flu)-Gly-GCATCCCGT GGT CCT-NHj 
Lys(Flu)-Lys(Flu)-CAC AAG ACA TGC ATC-NHj 
Lys(Flu)-CAC AAG ACA TGC ATC-NHj 
Lys(Flu)-GGC TTT TAA GGA TTC-NHj 
30 Lys(Rho)-GGC TTT TAA GGA TTC-NHa 



(OK 223/niodlfied Seq ID no 33) 
(OK 310/modified Seq ID no 34) 
(OK 655/modified Seq ID no 34) 
(OK 689/modlfled Seq ID no 40) 
(OK 702/modlfied Seq ID no 40) 
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MOTT23S 

FIu-p-Ala-p-Aia-GAT CAA TGC TCG GTT-NHj 
Flu-|J-Ala.|3-Ala-CGA CTC CAC ACA AAC-NHj 

MOTT16S 

Flu-p-Aia-p-Ala-GCA TTA CCC GOT GGC-NHa 



(OK624/modified Seq ID no 44) 
(OK612/modified Seq ID no 76) 



(OK 623/modified Seq ID no 85) 



40 



Drug resistance 

Lys(Flu)-GtC TTT TCG TCC TGC-NH2 
Lys(Rho)-GTC TTA TCG TCC TGCNHa 



(OK 745/modified Seq ID no 89) 
(OK 746/modified Seq ID no 90) 
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Lys(Rho)-GTC TTC TCG TCC TGC-NHj 
Lys(Rho)-GTC TTG TCG TCG TGC-NHj 
Lys(Rhoy-GTC TAt TCG TCC TGC-NHj 
Lys(Rho)-GTC TCT TCG TCC TGC-NHj 
5 Ly8(Rho).GTC TGT TCG TCC TGC-NHj 

Precursor rRNA 

Lys(Flu)-AAC ACT CCC TTT GGA-NHj 



(OK 746/modified Seq ID no 91) 
(OK 746/modified Seq ID no 92) 
(OK 747/modlfied Seq ID no 93) 
(OK 747/modlfied Seq ID no 94) 
(OK 747/rnodlfied Seq ID no 95) 



(OK 749/modlfied Seq ID no 123) 



1 0 Reduction of non-specific binding 

GTC CGT GAA CCC GAT-NHj 
Gly-TAC GCT CTT TGA GCT-NHj 

EXAMPLE 4 

15 

Initially the ability of the peptide nucleic acid probes to react with target sequences of 
mycobacterial rRNA was tested by dot blot earned out with rRNA from M: bovis BOG. M. 
avium and E.coli. 

20 M. bovis BCG {Statehs Serum Institut, Denmark) and M. Intracellulare (kindly provided by 

Statens Serum Institut) were grown in Dubos broth (Statens Serum Institut) or on Lbwenstein- 
Jensen slants (Statens Serum Institut) at 37 ""C. RNA was isolated from the bacterial cells 
using TRI-reagent (Sigma) following manufacture's directions. E. coll rRNA was purchased 
from Boehringer Mannheim, Germany. 

25 

200 ng M. bovis RNA, M. intracellulare RNA and E. coll rRNA were dotted onto membranes 
(Schleicher & SchQel. NY 1 3 N), and the membranes were dried and fixed under UV light for 2 
minutes. 

30 Protocol for dot blot assay 

Each of the pnDbes (70 nM probe in hybridisation solution (50 mM Tris, 10 mM NaCI, 10% 
(w/v) Dextran sulphate, 50% (v/v) glycerol. 5 mM EDTA, 0.1% (w/v) sodium pyrophosphate, 
0.2% (w/v) polyvinylpyrrolidone, 0.2% (w^) Ficoll, pH 7.6.)) were spotted onto a membrane. 
Hybridisation w^ continued for 1 .5 hours at 55 or 65 ''C. respectively. The membranes were 

35 rinsed 2 times for 15 minutes irt 2 x SSPE buffer (1 x SSPE: 0.15 M NaCI, 10 mlW sodium 
phosphate. 1 mM EDTA, pH 7.4) containing 0.1% SDS at ambient temperature, and 
subsequentiy 2 times for 16 minutes in 0.1 x SSPE buffer containing 0.1% SDS at 55 or 66 °C 
(see Table 1 ). The membrane was blocked with 0.6% (w/v) casein dissolved in 0.5M NaCI, 
0.05M Tris/IHCI pH 9.0. Thereafter, the membranes were incubated for 1 hour with rabbit-anti 



(OK 507/modifled Seq ID no 121) 
(OK 714/modified Seq ID no 122) 
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FITC antibody labelled with alkaline phosphatase (AP) (DAKO K0046 via! A) diluted 1:2000 in 
0.5% casein dissolved in 0.5M NaCI, O.OSIVI Tris/HCI pH 9.0. After incubation, the membranes 
were washed 3 times 5 minutes with TST buffer (0.05M Tris, 0.5M NaCI, 0.6% (w/v) Tween 
20®, pH 9) at ambient temperature. Bound probes were visualised following standard 
5 procedures using BCIP/NBT, and the visualisation was stopped by incubation for 1 0 minutes 
vtrith 10 mM EDTA. The blot was dried at 50 «*C. 

The results are given in Table 1 below. 

10 TABLE 1 





E. coli 


M. bdvis BCG 


M. intracelluiare 




rRNA 


RNA 


RNA 


Probe 


55 ''C 


65 «C 


55 °C 


65 


55 


65 


OK 305 


negative 


negative 


positive 


positive 


negative 


weak 


OK 307 


negative 


negative 


positive 


positive 


negative 


weak 


OK 309 


negative 


negative 


positive 


positive 


negative 


weak 


OK 223 


negative 


negative 


positive 


positive 


nd 


nd 


OK 310 


negative 


negative 


negative 


positive 


negative 


negative 



nd: Not detenntned 



The results indicate that all five peptide nucleic acid probes are capable of hybridising to target 
sequence of M. bovis BCG rRNA (as a representative of the IVITC group), whereas no 
1 5 hybridisation to E. coli rRNA (as a representative of organlsnris other than mycobacteria) and 
no detectable hybridisation to M. intracellularB rRNA were observed (as a representative of 
the MOTT group). 

EXAMPLE 5 

20 

This example illustrates the ability of the peptide nucleic acid probes to penetiBte the 
mycobacterial cell wall and subsequentiy hybridise to target sequence of mycobacteria of ttie 
MTC group and not mycobacteria of the MOTT group, In particular not mycobacteria of the 
MAC group, or Neisseria gonorriioeae, by fluorescence in situ hybridisation (FISH). 

25 

Preparation of bacterial slides 

M. bovis BCG (Statens SeruminstiM Denmark), M. avium (kindly provided by Statens 
Seruminstitut, Denmari<). and M. intracelluiare (kindly provided by Statens Seruminstitut, 
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Denmark) were grown In Dubos broth (Statens Seaimlnstitut, Denmark) or on LCwenstein- 
Jensen slants (Statens Seruminstitut, Denmark) at 37 ^'C. N. gonontoeae (Statens 
Serumlnstitut, Denmark) was grown on chocolate agar (Statens Seruminstitut. Denmark) at 37 
°C with additional 5% COj. 

5 

Cultures were smeared onto microscope slkies and fixed according to standard procedures. 
Prior to the hybridisation, the smears were immersed Into 80% ethanol for 15 minutes, and 
subsequently rinsed with water and air dried. This step is not essential for the following 
hybridisation step, but it is anticipated that It will kill any viable mycobacteria on the slides, and 
10 may further serve as an additional fixation step. 

Protocol for fluorescence in situ hybridisation (FISH) 

1 . The bacterial slide was covered with a hybridisation solution containing the probe In 
question. 

1 5 2. The slide was incubated in a humid incubation chamber at 45^C or 55°C for 90 
minutes. 

3. The slide was washed 25 minutes at 45°C or 65*^C in prewamied wash solution (5 mM 
Tris, 145 mM NaCI. pH 1 0) followed by 30 seconds in water. 

4. The slide was dried and mounted with IMAGEN Mounting Fluid (DAKO, Copenhagen, 
20 Denmari^) 

The hybridisation solution contains 50 mM Tris, 10 mM NaCI. 10% (w/v) Dextran sulphate. 
30% (v/v) fomiamide. 0.1% (v/v) Triton X-IOO®, 5 mM EDTA, 0.1% (w/v) sodium 
pyrophosphate. 0,2% (w/v) polyvinylpyrrolidone, 0.2% (w/v) Ficoll. pH 7.6, 

25 

Whenever possible, the applied equipment wias heat-treated, and solutions were exposed to 
1^l/ml diethylpyrocarbonate (Sigma Chemical Co.) in order to inactivate nucleases. 

Microscopically examinations were conducted using a fluorescence microscope (Leica, 
30 Wetzlar, Germany) equipped with a 100x/1.20 water objective, a HB0 100 W lamp and a 
FITC filter set Mycobacteria were identified as fiuorescent. 1 - 10 ^m slender, rod-shaped 
bacilli. 

Fluoresceln-labeiled peptide nucleic acid probes targeting 23S rRNA of the mycobacteria of 
35 th^ MTC group (OK 306, OK 309, OK 446, OK 449) and 16S riRNA of the mycobacteria of the 
MTC group (OK 310) were tested. Individual probe concentrations and incubation 
temperatures are listed together with the results in Table 2 and 3. 
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TABLE 2 





OK 306 


OK 309 


OK 446 


OK 449 




250nM 


25Qnh/\ 


SOOnM 


SOOhM 




45«C 


46X 


55*C 


55*^0 


M. bovis BCG 


positive 


positive 


positive 


positive 


M. avium 


negative 


negative 


negative 


negative 


M. intraoellulare 


negative 


negative 


not determined 


not determined 


N. gonorrhoeae 


negative 


negative 


not detemiined 


not determined 


TABLES 












OK 447 


OK 310 


OK 306/OK310 






IJIIVI 




500/50Qnl\/l 






55*»C 


45''C 


55*0 




M. bovis BCG 


positive 


positive 


positive 




M. avium 


negative 


negative 


negative 




M, intracellulare 


not determined 


negative 


negative 




N. gonorrhoeae 


not determined 


negative 


not determined 





5 It can be concluded that the probes are able to penetrate the mycobacterial cell wall of 
mycobacterium cultures and subsequently hybridise to target rRNA sequence. This maJ<es 
possible the development of fluorescence in situ hybridisation (FISH) protocols for specific 
detection of mycobacteria. 



10 EXAMPLES 

Test of probes on clinical smears of sputum 
. The ability of the peptide nucleic acid to penetrate the cell wall of mycobacteria of the MTC 
group In clinical samples was tested on smears of sputum from suspected cases of 
16 tuberculosis (kindly provided by Division of Microbiology, Ramathibodi Hospital, Bangkok, 
Thailand) by fluorescence in situ hybridisation (FISH). Smears from the same patient were 
initially evaluated positive by Ziehl-Neelsen staining, which shows only the presence of add 
fast bacilli, not whether these are mycobacteria of the MTC group. 

20 Fluorescein-labelled peptide nucleic acid probes targeting 235 rRNA of the mycobacteria of 
the MTC group (OK 306. OK 446. OK 449) and 163 rRNA of the mycobacteria of the MTC 
group (OK 310) were used. Furthennore, a random peptide nucleic acid probe (a 15-mer 
wherein each position may be A, T. C or G (obtained from Millipore Corporation, BedfbreJ. MA, 
USA) was added to the hybridisation solution in order to increase the signal-to-noise ratio. 
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FISH was carried out at 55 as described in Example 5. Applied probe concentrations are 
listed together with the results in Table 4 and 5. 



TABLE 4 



oarnpie 
niimhor 

1 lUIIIUd 


uiN 44o/Kanaoin 
iiiM/Rniiiui 


OK 449/Kanaom 


Zieni-Neelsen 
staining 




rOSlUVo 


Kosiuve 


4+ 


335 


Positive 


Eq. 


2* 


345 


Positive 


Positive 


3+ 


224 


Positive 


Positive 


3+ 


297 


Negative 


Eq. 


2+ 


179 


Negative 


Negative 


4+ 


247 


Negative 


Negative 


2+ 


255 


Positive 


Positive 


2+ 


202 


Eq. 


Positive 


2+ 



s 

TABLE 5 



Sample 
number 


OK 306/OK310 
500/500 nIVI 


Ziehi-Neelsen 
staining 


213 


Positive 


4+ 


292 


Positive 


4+ 


159 


Positive 


3+ 


287 


Positive 


3+ 



Smears stained by Ztehf-Neelsen staining were examined with a 100x objective and scored according 
to the following method: 0 bacilii. +/-: 1-200 per 300 fields, 2+: 1-9 per 10 fields, 3+: 1-9 per field. 4+: 
>9 per Held. 

10 Positive: Several mycobacteria were identified in the smear. Negative: No fluorescent mycobacteria 
were identified in the smear. Eq: Few (1-3) fluorescent mycobacteria were identified in the smear. 

It appears from the table that the peptide nucleic acid probes are able to penetrate and 
subsequently hybridise to target sequence of mycobacteria of the MTC-group in AFB-positive 
15 sputum smears. The fact that not all AFB-positive sputum smears are found positive with 

applied probes indicate that not ail AFB-positive sputum smears contains mycobacteria of the 
MTC-group. 

EXAMPLE? 

20 

The reactivity and specificity of selected peptide nucleic acid probes for detecting 
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mycobacteria of the MTC group as well as probes for detecting fnycobacteria of the MOTT 
group were evaluated by fluorescence in situ hybridisation (FISH) on control smears prepared 
from cultures of different mycobacterium species. The mycobacteriurh species were selected 
so as to be representative for the mycobacterium genus as well as to Include clinically relevant 
5 species. 

M. tuberculosis (ATCC 25177). M. bovis BCG (ATCC 35734). M. intracellulare (ATCC 13950), 
M. avium (ATCC 25292), M. kansasil (ATCC12479), M. gordonae (ATCC 14470). M. 
scrofulaceum (ATCC 19981). M. absoessus (ATCC19977). M. marinum (ATCC 927), M. 
10 simiae (ATCC 25575). M. szulgai (ATCC 35799). M. flavescens (ATCC 23033), M. fbrtuitum 
(ATCC 43266) and M. xenopi (ATCC19250) were grown at Dubos broth (Statens Serum 
Institut) at 37 ''C with the exception of M. marinum which was grown at 32 ''C. 

Smears were prepared as described in Example 5. FISH was carried out as described below. 

15 

Protocol for fluorescence in situ hybridisation (FISH) 

1 . The bacterial slide was covered with a hybridisation solution containing the probe in 
question. 

2. The slide was incubated in a humid incubation chamber at 55''C for 90 minutes. 

2D 3. The stilde was washed 30 minutes at 55X in prewanned wash solution (5 mM Tris, 1 5 
mM NaCI, 0.1% (v/v). Triton X-100* pH 10) followed by 30 seconds in water. 
4. The slide was dried and mounted with IMAGEN Mounting Fluid (DAKO, Copenhagen. 
Denmark) 

25 The hybridisation solution contained 50 mM Tris, 10 mM NaCI, 10% (w/v) Dextran sulphate. 
30% (v/v) formaniiide, 0.1% (v/v) Triton X-100® 5 mM EDTA, 0.1% (w/v) sodium 
pyrophosphate. 0.2% (w/v) polyvinylpynx)lidone, and 0.2% (w/v) Ficoll, pH 7.6. To avoid non- 
specific binding of the labelled peptide nucleic acid probe, 1-5 ^M of non-labelled, non-sense 
peptide nucieto acid probe was added to the hybridisation solution (OK 507/modified Seq ID 

30 no 121 and/or OK 714/nfK>dified Seq ID no 122). 

Whenever possible, the applied equipment was heat-treated, and solutions were exposed to 
l^t/ml diethylpyrocarbonate (Sigma Chemical Co.) in order to inactivate nucleases. 

35 Microscopic examinations were conducted using a fluorescence microscope (Leica, Wetzlar, 
Germany) equipped with a 100x/1.30 oil objective, a HB0 100 W lamp and a FITC/tRITC dual 
band filter set Mycobacteria were identified on basis of both fluorescence (strong, medium, 
weak, no) and morphology (1-10 \im slender, rod-shaped bacilli. Mycobacteria of the MOTT 
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group may appter pleomorphic, ranging in appearance from long rods to coccoid foms) 



Probe concentrations are listed together with the results in Table 6 and 7 (probes targeting 
mycobacteria of the MTC group) and Table 8 (probes targeting to mycobacteria of the MOTT 
group). 



TABLES 





OK 450 
25 nM 


OK 682 
100 nM 


OK 689 

100 nM 


OK 688 

250 nM 


OK 660 
100 nM 


M. tuberculosis 


+++ 


+++ 


+++ 


+++ 


+++ 


IVI. uUVIS DV^o 


+++ 


+++ 


+++ 






M. intraceiluiare 












M. avium 












M. kansasii 


++ 










1^. gordonae 












M. scrofulaC0um 


+++ 










M. abscessus 


,;, 








+ 


M. marinum 






+ 


+ 


+++ 


M. simiae 












M. sizulgai 


+++ 










M. flavescens 




++ 








M. fortuitufti 




+ 








M. xenopi 













strong fluorescence. noedium fluorescence, + weak fluorescence, - no fluorescence 
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TABLE 7 



Mycobacteria 


OK 655 
150 nM 


OK 448 
50 nM 


OK 654 
100 nM 


OK 446 
25 nM 


M. tuberculosis 


+++ 


+++ 






M. bovis BCG 


+++ 


+++ 






M. intracellulare 










M. avium 










M. kansasii 










M. gordonae 










M. scrofiiiaceum 










M. abscessus 






+ 




M. marinum 






+ 


+++ 


M. simiae 










M. szuljgai 










M. flavescens 










M. fortultum 










M. xenopi 











+++ strong fluorescence, ++ medium fluorescence, + weak fluorescence, - no fluorescence 
TABLE 8 



Mycobacteria 


OK 612 
100 nM 


OK 624 
100 nM 


OK 623 
100 nM 


M. tuberculosis 








M. bovis BCG 








M. intracellulare 




++ 


++ 


M. avium 


+++ 


+++ 


+++ 


M. Icansasii 






+++ 


M. gordonae 




++ 


++ 


M. scrofulaceum 




++ 


++ 


M. abscessus 




++ 


+++ 


M. marinum 








M, simiae 


- 


++ 


+++ 


M. szulgai 






+++ 


M. flavescens 








M. fortuilum 




++ 




M. xenopi 









-H>+ strong fluorescence, ^* medium fluorescence, weak fluorescence, - no fluorescence 

Each of probes indicated in Table 6, 7 and 8 was further investigated with regard to 
hybridisation to other common respiratory bacteria, namely Corynebacterium spp., 
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Fusobacterium nucleatum, Haemophilus influenzae. Klebsiella pneumoniae, Pseudomonas 
aeruginosa, Propionlbacterium acnes, Streptococcuc pneumoniae, Staphylococcus aureus. 
Bfahamella catarrahalls, Escherichia coll, Neisseria spp.. Actinobacter calcoaceticus, 
Actinoitiyces spp., Enterobacter aerogenes, Proteus mirabilis. Pseudomonas maltophiiia. 
5 Streptocussuc viridans, and Norcardia asteroides. No cross-hybridisation was observed by 
fluorescence in situ hybridisation to any of these bacteria in the case of OK 682, OK 654, OK 
665, OK 688. OK 660. OK 612. OK 624 and OK 623. Some cross-reactivity was obsen^ed in 
the case of OK 446 (to P. acnes), OK 448 (to P. acnes and B. catarrhalis), and OK 450 (to P. 
acnes and B. catarrhalis). 

10 

Table 6 and 7 shows that none of the MTC probes cross-react with M. intracellulare and/or M. 
avium, but indeed strongly with M, tuberculosis and M. bovis BCG. As shown in Table 8. both 
OK 624 and OK 623 hybridise to M. intracellulare and M. avium which are both members of 
the MAC group, whereas none of them hybridise to M. tuberculosis or M. bovis BCG. OK 612 
15 hybridises to M. avium only. It should be noted that the aligned sequence of M. intracellulare 
has Just one nucleobase difference to the target sequence of M. avium, see Figure 4K. 

The data support the use of the methodology described in claim 3 and 4 and exemplified in 
Example 2 for design of peptide nucleic acid probes that are capable of hybridising to target 
20 sequence of one or more mycobacterium species and not to omer mycobacterium species 
having at least one nucleobase difference to the target sequence. 

EXAMPLES 

25 To study the usefulness of the peptide nucleic acid probes in distinguishing between 

mycobacteria of the MTC group and mycobacteria of the MOTT group, the probes were tested 
on smears of mycobacterium-positive cultures prepared ftx)m 34 + 28 clinical samples (sputum 
samples, otiier respiratory samples and extrapulmonary samples) from individuals suspected 
of tuberculosis or other mycobacterial infections (kindly provided by the Mycobacterium 

30 Department. Statens Serum Institut, Denmark). Complex/species identification data obtained 
with the AccuProbe tests from Gen-Probe Inc., USA were available for each sample. 

Table 9 shows the results obtained with four different peptide nucleic acid probes targeting 
mycobacteria of the MTC group (OK 682, OK 660, OK 688 and OK 689) and one probe 
35 targeting mycobacteria of the MOTT group (OK 623), and Table 1 0 shows the results obtained 
with two peptide nucleic acid probes targeting mycobacteria of the MOTT group (OK 623 and 
OK 612) and a mixture of two probes targeting mycobacteria of the MTC group (OK 688 and 
OK 689). Data. are airanged according to the results obtained by AccuProba Sample 
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preparation, hybridisation and vtsualisaition were performed as descnted in Example 7. 



TABLE 9 



Complex/ 


OK 623 


OK 682 


OK 660 


OK 688 


OK 689 


species (n) 


25 nM 


100 nM 


100 nM 


250 nM 


100 nM 




np 




np 


Op 


np 


MTC (23) 


0 


23 


23 


23 


23 


M. avium (5) 


5 


0 


0 


0 


0 


M. gordonae (3) 


3 


0 


0 


0 


0 


Unknown' (3) 


3 


0 


0 


0 


0 



np denotes number of posKive samples. 

The tenri 'unknown" means that the sample not contains mycobacteria of the MTC group, or 
mycobacteria of the MAC group according the AccuProbe test, but further species identification 
was not performed. 



TABLE 10 



Complex/ 


1 OK 623 


OK 612 


OK688/OK 689 


spedes (n) 


25nM 


100 nM 


50 nM/50 nM 




1 


np 




MTC (17) 


0 




16 


M. avium (2) 


2 


2 


0 


M. gordonae (4) 


3 


0 


0 


Unknown' (5) 


5 


0 


0 



np denotes number of positive samples. 

The term "unknown" means that the sample not contains mycobacteria of the MTC group, or 
mycobacteria of the MAC group according to the AccuProbe test, but further species identification 
was not performed. 

The results shown in Table 9 are in conformity with the complex/species identification 
performed with the AccuProbe tests, and thus confirm that peptide nucleic acid probes can be 
used to determine whether an infection is caused by mycobacteria of the MTC group or by 
mycobacteria of the MOTT group. 

From the results in Table 10. it can be seen that it is possible to differentiate between 
mycobacteria of the MTC group and mycobacteria of the IVIOTT group with 1 00% specificity 
and 9^-94% sensitivity relative to results obtained by the AccuProbe tests. Furthermore, OK 
612 is very suitable for specific Identification of M. avium among those being positive for 
nnycobacteria of the MOTT group as the result Is positive in the case of M. avium and negative 
in the other cases of mycobacteria of the MOTT group. 
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EXAMPLE 9 

Direct d0t0<^on of mycobacteria in clinical smears of sputum 
This example demonstrates the ability of the peptide nucleic acid to detect and identify 
5 mycobacteria directly In AFB-posltive sputum samples from suspected cases of tuberculosis 
(kindly provided by Division of Microbiology, Ramathibodi Hospital, Bangkok, Thailand) and 
suspected cases of other mycobacterial infections (kindly provided by Clinical Microbiology 
Dept. Rigshospitalet, Copenhagen. Denmaric) by FISH is shown. 

10 The clinical smears were prepared according to the procedure described In Example 5, and 
FISH was performed as described in Example 7. The results are shown in Table 11. 



TABLE 11 



Sampiia no. 


OK 623 
25 nM 


OK 654 
100 nM 


OK655 
150 nM 


OK682 
100 nM 


OK 688 
250 nM 


OK 689 
100 nM 


1 




++ 


++ 


++ 


++ 


++ 


175 




++ 


nd 


nd 


++ 


++ 


469 






nd 


nd 






166 








nd 






268 




-H- 




++ 


++ 


•H- 


34267 


++ 













nd: not determined 



15 +++ strong fluorescence, ++ medium fluorescence, + weak fluorescence, - no fluorescence 

It appears from examples in Table 1 1 that AFB-positive sputum smears were evaluated 
positive for mycobacteria of the MTC group (sample numbers 1 , 175, and 268), positive for 
mycobacteria of the MOTT group (sample number 37267), or negative for mycobacteria 

20 (sample numbers. 459 and 166) by the applied probes. Thus, PNA-probes are useful reagents 
for specific identification of mycobacteria directly in sputum smears by fluorescence in situ 
hybridisation. AFB-positive sputum samples that are negative with all probes may be 
explained in three ways: a) the sample may contain mycobacteria not detected by the probes, 
e.g. M. fbrtuitum, b) the sample may contain other add-feist bacteria than mycobacteria, or c) 

25 the mycobaicteria in the sample lack or have a strongly reduced content of rRNA due to for 
example antibiotic treatment 

In conclusion, direct identification of mycobacteria In smear-positive sputum samples by 
peptide nucleic acid-based fluorescence in situ hybridisation combines simplicity and 
30 morphological advantages of current staining methods with concomlnant species 
identification, and will thus allow clinical microbiology laboratories to benefit ftx>m the 
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advantages offered by molecular techniques to provide crucial information pertaining to 
therapy and patient management 

EXAIVIPLE10 

5 

This example demonstrates simultaneous detection and identification of mycobacteria of the 
MTC group and mycobacteria of the MOTT group using differently labelled probes targeting 
mycobacteria of the MTC group and nnycotacteria of the MOTT group, respectively, by 
fluorescence in situ hybridisation. 

10 

Control smeairs of different mycobacterium species were prepared as described in Example 5. 
In addition, smears containing a mixture of M. tuberculosis and M. avium were prepared 
(Table 8. last row). FISH was performed as described in Example 7. 

15 A rhodamlne-labelled peptide nucleic acid probe targeting 16S rRNA of mycobacteria of the 
MTC group (OK 702) and a fluorescein-labelled peptide nucleic acid probe targeting 16S 
rRNA of mycobacteria of the MOTT group (OK 623) were applied simultaneously In the 
concentrations listed in Table 12 together with the results. 

20 TABLE 12 



Mycobacterium species 


OK623/OK702 
25/250 nM 


M. tuberculosis 


-(Gy+++(R) 


M. bovis BCG 


-(G)/ +++ (R) 


M. avium 


+++(G)/-(R) 


M. intracellularie 


+++(G)/-(R) 


M. itansasii 


+++(G)/-(R) 


M. avium / tuberculosis 


+++ (G)/+++ (R) 



+++ strong fluorescence - no fluorescence 
G green fluorescence, R red fluorescence 



Mycobacteria of the MTC group, i.e. M. tuberculosis and M. bovis. were observed as green 
25 fluorescent mycobacteria, whereas mycobacteria of the MOTT group, i.e. M. avium, M. 

intracellulare and M. kansasli, were observed as red fluorescent mycobacteria. Mycobacteria 
in the M. avium/M. tuberculosis mb(ture were Identified by a mixture of both green fluorescent 
mycobacteria and red fluorescent mycobacteria. 



30 The results show that it Is possible to distinguish between different Mycobacterium species in 
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one smear using a mixture of differently labelled probes. Such simultaneous detection and 
identification of mycobacteria may further be extended to comprise three or more differently 
labelled peptide nucleic acid probes. 

5 EXAMPLE 11 

The ability of a peptide nucleic acid probes to hybridise to precursor rRNA and further to 
distinguish be^A/een precursor rRNA of M. tuberculosis and precursor rRNA of M. avium was 
investigated by fluorescence in situ hybridisation. 

10 

Smears were prepared as described in Example 5 and FISH were carried out as described in 
Example 7 using a fluorescein-labelled probe targeting precursor rRNA of M. tuberculosis (OK 
749). The results are given in Table 13. 

15 TABLE 13 



Mycobacterium 


OK 749 
1000 nM 


M. tuberculosis 


+ 


M. avium 





+ weak fluorescence - no fluorescence 



From the results, it can be concluded that it is possible to detect precursor rRNA. and further 
that is possible to distinguish between precursor rRNA from different mycobacterium species. 
20 The application of peptide nucleic acid targeting precursor rRNA may be particularly useiful for 
measuring the mycobacterial growth and thus be an indicator of the viability of the 
nriycobacteria. This would in particular be important for monitoring of the effect of antibiotics in 
relation to both treatment oif tuberculosis and drug susceptibility studies. 

25 EXAMPLE 12 

The ability of peptide nucleic acid probes for differentiation of drug susceptible and drug 
resistant mycobacteria was evaluated using a fluorescein-labeiled probe targeting the wild 
type sequence of 23S rRNA of M. avium and M. intracellulare togetiier witti riiodamine- 
30 labelled probes targeting single point mutations associated witii macrolide resistance in M. 
avium and M. intracellulare. 



Smears were prepared as described in Example 5 from cultures of M. avium (ATCC no. 
25292) and M. intracellulare (ATCC no. 13950). These strains are anticipated to contain the 
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wild type sequence of rRNA. Miacrollde resistant variants were not available. FISH was carried 
out as described in Example 7 using a fluoresceln-labelled peptide nucleic add probe 
targeting wild type 23S rRNA (OK 745) and a mixture of rtiodamine-labelled peptide nucleic 
acid probes targeting the three possible mutations at position 2568 (OK 746) and at position 
5 2569 (OK 747) of M. avium 23S ri3NA of GenBank entry X5291 7 (see Figure 6). The results 
are given in Table 14. 



TABLE 14 



IVIycobacterium spedes 


OK745/OK746/OK747 

500/500/500 nM 


M. avium (wild type) 


(G)/.(R) 


M. intracellulare (wild type) 


+++(G)/.(R) 



+++ strong fluorescence - no fluorescence 
10 G green fluorescence. R red fluorescence 

OK 746 and OK 747 are each a mixture of three single point mutation probes 

The results in Table 14 show that M. avium and M. intracellulars are detected with the 
fluoresceln-labelled probe (OK 745) targeting M. avium and M. intracellulare wild types and 
15 not detected wBh the mixture of rhodamine-labelled probes (OK 746 and OK 747) targeting 
single point mutations assodated with macrolide resistance. Such peptide nudeic acid probes 
targeting the wild type and drug resistant variants, respectively, may be important tools for 
both the prediction of an efficient therapy as wiell as for monitoring the effect of the treatment. 

20 

EXAMPLE 13 

To illustifatd the speed with which peptide nucleic add probes penetrate the mycobacterial cell 
wall and subsequently hybridise to their target sequence the protocol described in Example 7 
25 wats modified to 15 minutes hybridisation time and the results compared with 90 minutes 

hybridisation time. Smears were prepared as described in Example 5. The results are given in 
Table 15. 
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TABLE 15 





OK 623 
25 nM 


OK 689 
100 nM 




15 min 


90 min 


15 min 


90 min 


M. tuberculosis 






++ 


+++ 


M. avium 


++ 


+++ 







+++ strong fluorescence ++ medium fluorescence 
+ weal< fluorescence - no fluorescence 



The data presented in Table 15 sifiow that hybridisation by peptide nucleic add probes inside 
the mycobacterial cells is accomplished in a very short time resulting in a detectable signal 
after just 15 minutes incubation. Thus, the use peptide nucleic acid probes makes possible the 
development of very fast fluorescence in situ hybridisation protocols. 

EXAMPLE 14 

To describe the ability of very short peptide nucleic acid probes to hybridise to farget 
sequences, a 12-mer peptide nucleic acid probe labelled with fluorescein (OK 575) was tested 
by fluorescence in situ hybridisation (FISIH). 

Smears were prepared as described in Example 5 and FISH were earned out as described in 
ExaiTiple 7. The results are given in Table 16. 



TABLE 16 



Mycobacterium 


OK 575 




50 nM 


M. tuberculosis 


+ 


M. bovis BOG 


++ 


M. avium 




M. intracellulare 




M. kansasii 





•^-t^ medium fluorescence + weak fluorescence - no fluorescence 

The results in table 17 shows that a 12-mer peptide nudeic acid probe is capable of 
hybridising speciflcally to target sequences under the same stringency conditions as 15-mers. 
A lower florescence intensity Is obtained as the T„, for a 12-mer peptide nucleic add probe is 
lower than T,„ for a 15-mer peptide nudeic acid probe. 
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The data clearly suggest that by lowering the stringency condition, e.g. by decreasing the 
hybridisation/washing temperature and/or the concentration of fbrmamide. even shorter 
probes may be applied for detection of mycobacteria provided that specific sequences of such 
can be designed. 
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CLAIMS 



1. Peptide nucleic acid probe for detecting a target sequence of one or more mycobacteria 
optionally present in a sample, said probe being capable of hybridising to a target sequence of 

5 mycobacterial rDNA, precursor rRNA 6r rRNA forming detectable hybrids, 
and a mixture of such probes. 

2. Peptide nucleic acid probe according to claim t said probe being capable of hybridising to 
a target selquence of mycobacterial rDNA, precursor rRNA. or 23S, 16S or 5S rRNA forming 

10 detectable hybrids, 

and a mixture of such probes. 

3. Peptide nucleic acid probe according to claim 1 or 2. said probe being capable of 
hybridising to a target sequence of mycobacterial rDNA, precursor rRNA. or 23S, 16S or 5S 

15 rRNA forming detectable hybrids, said target sequence being obtainable by 

(a) comparing the nucleobase sequences of said mycobacterial rRNA or rDNA of one or more 
mycobacteria to be detected with the conresponding nucleobase sequence of organlsm(s). in 
particular other mycobacteria, in particular other mycobacteria, from which said one or nru>re 

20 mycobacteria are to be distinguished. 

(b) selecting a target sequence of said rRNA or rDNA which includes at least one nucleobase 
differing from the corresponding nucleobase of the organism(s), in particular other 
mycobacteria, from which said one or more mycobacteria are to be distinguished, and 

25 

(c) determining the capability of said probe to hybridise to the selected target sequence to 
form detectable hybrids, 

and a mixture of such probes. 

30 4. Peptide nucleic acid probe according to claim 1 or 2, said probe being capable of 

hybridising to a target sequence of mycobacterial rDNA, precursor rRNA or 23S, 16S or 5S 
rRNA forming detectable hybrids, said probe being obtainable by 

(a) comparing the nucleobase sequences of said nnycobacterial rRNA or rDNA of one or more 
35 mycobacteria to be detected with the con-esponding nucleobase sequence of organism(s), in 
particular other mycobacteria, in particular other mycobacteria, from which said one or more 
mycobacteria are to be distinguished. 
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(b) selecting a target sequence of said rRNA or rDNA which includes at least one nucleobase 
differing from the conresponding nucleobase of the organism(s), In particular other 
mycobacteria, from which said one or more mycobacteria are to be distinguished. 

5 (c) synthesising said probe, and 

(d) determining the capability of said probe to hybridise to the selected target sequence to 
fonti detectable hybrids, 
and a mbcture of such probes. 

10 

5. Peptide nucleic acid probe according to any one of claims 1 to 4 for detecting a target 
sequence of one or more mycobacteria of the Mycobacterium tuberculosis Complex (MTC) or 
for detecting a target sequence of one or more mycobacteria other than mycobacteria of the 
Mycobacterium tuberculosis Complex (MOTT) optionally present in a sample, which probe 
15 ' comprises from 6 to 30 polymerised peptide nucleic acid moieties, said probe being capable of 
hybridising to a target sequence of mycobacterial rDNA, precursor rRNA or 23S, 16S or 5S 
rRNA fonnihg detectable hybrids, 
and a mixture of such probes. 

20 6. Peptide nucleic acid probe according to any one of claims 1 to 5 for detecting a target 
. sequence of rDNA, precursor rRNA or 23S, 16S or 5S rRNA of one or more hriycobacteria of 
the Mycobacterium tuberculosis Complex (MTC) or for detecting a target sequence of rDNA, 
precursor rRNA or 23S. 16S or 5S rRNA of one or more mycobacteria other than 
mycobacteria of the Mycobacterium tuberculosis Complex (MOTT) optionally present in a 

25 sample, which probe comprises from 10 to 30 polymerised moieties of formula (I) 



wherein each X and Y independently designate 0 or S, 

each Z independently designates O, S, NR\ or C(R^)2, wherein each R^ independently 
designate H, C^^ alkyl. C,^ alkenyl, C,^ alkynyl, 
35 each R^, R^ and R^ designate independently H, the side chain of a naturally occurring amino 
acid, the side chain of a non-naturally occurring amino acid, Cm al^^li ^^•4 alkenyl or Ci^ 
alkynyl, or a functional group, each Q independently designates a naturally occurring 
nucleobase, a non-naturally occurring nucleobase, an intercalator, a hucieobase-binding 



30 




(I) 
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group, a label or H, 

with the proviso that the probe comprising such subsequence is capable of forming detectable 
hybrids with the target sequence of said mycobacterial rDNA, precursor rRlsiA or 23S, 16S or 
5 SSrRNA. 

and a mixture of such probes. 

7. Peptide nucleic acid probe according to any one of claims 1 to 6 for detecting a target 
sequence of 23S rRNA of one or more mycobacteria of the Mycobacterium tuberculosis 
10 Complex (MTC) optionally presient in a sample, which probe comprises from 10 to 30 
polymerised moieties of fbmnula (I) as defined In claim 6, 

with the proviso that the Qs of adjacent moieties are selected so as to forni a sequence of 
which a subsequence includes at least one hucleobase that is complementary to a 
15 nucleobase of M. tuberculosis 23S rRNA differing from the corresponding nucleobase of at 
least M. avium located within the following domains 

Positions i49-15iB in Figure 1A, 

Positions 220-221 in Figure 1 A, 
20 Positions 328-361 in Figure 1 A and Figure 1 B, 

Positions 453-455 in Figure IB, 

Positions 490-501 in Figure 1B. 

Positions 637-660 in Figure 1C. 

Positions 706-712 in Figure ID. 
25 Positions 762-789 in Figure 1 D. 

Position 989 in Figure 1 D, 

Positions 1068-1072 in Figure 1D, 

Position 1148 in Figure IE, 

Positions 1311-1329 in Figure IE, 
30 Positions 1361-1364 In Figure IF, 

Position 1418 In Figure IF, 

Positions 1563-1570 in Figure IF, 

Positions 1627-1638 in Figure 1G. 

Positions 1675-1677 in Figure 1G, 
36 Position 1718 in Figure 1G, 

Positions 1 734-1 740 In Figure 1 H. 

Positions 1967-1976 in Figure 1H, 

Positions 2403-2420.in Figure 1H, 
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Positions 2457-2488 in Figure 11, 
Positions 2952-2956 in Figure 11, 
Positions 2966-2969 in Figure 1J, 
Positions 3000-3003 in Figure 1J or 
5 Positions 3097-31 06 in Figure 1 J. 

and further with the proviso that the probe comprising such subsequence is capable of fbnming 
detectable hybrids with a target sequence of said mycobacterial 23S rRNA, 
and a mixture of such probes. 

10 

8. Peptide nucleic acid probe according to any one of claims 1 to 6 for detecting a target 
sequence of 16S rRNA of one or more mycobacteria of the Mycobacterium tuberculosis 
Complex (MTC) optionally present in a sample, which probe comprises from 10 to 30 
polymerised moieties of formula (I) as defined in claim 6, 

15 

with the proviso that the Qs of adjacent moieties are selected so as to form a sequence of 
which a subsequence includes at least one nucleobase that is complementeiry to a 
nucleobase of M. tuberculosis 16S rRNA differing from the corresponding nucleobase of at 
least M. avium located within the following domains 

20 

Positions 76-79 In Figure 2A, 

Positions 98-101 in Figure 2A, 

Positions 135-136 in Figure 2 A, 

Positions 194-201 in Figure 2B. 
26 Positions 222-229 in Figure 2B, 

Position 242 in Figure 2B, 

Position 474 in Figure 2C, 

Positions 1 1 36-1 1 45 in Figure 2C, 

Positions 1271-1272 in Figure 2C. 
30 Positions 1287-1292 In Figure 20. 

Position 1313 in Figure 2D, or 

Position 1334 In Figure 2D, 

and further with the proviso that the probe comprising such subsequence is capable of fbrnting 
35 detectable hybrids with a target sequence of said mycobacterial 16S rRNA, 
and a mixture of such probes. 



9. Peptide nucleic acid probe according to any one of claims 1 to 6 for detecting a target 
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sequence of 5S rRNA of one or more mycobacteria of the Mycobacterium tuberculosis 
Complex (MTC) optionally present in a sample, which probe comprises from 10 to 30 
polymerised moieties of formula (I) as defined in claim 6, 

6 with the proviso that the Qs of adjacent nrwieties are selected so as to form a sequence of 
which a subsequence indudes at least one nudeobase that is complementary to a 
nucleobase of M. tuberculosis 5S rRNA differing from the corresponding nudeobase of at 
least M. avium located within the following domain 

10 Positions 86-90 in Figure 3 

and further with the proviso that the probe comprising such subsequence Is capable of forming 
detectable hybrids with a target sequence of said mycobacterial 5S rRNA, 
and a mixture of such probes. 

15 

10. Peptide nudeic add probe according to any one of claims 1 to 8 for detecting a target 
sequence of 23S or 1 6S rRNA of one or more mycobacteria of the Mycobacteriurri 
tuberculosis Complex (MTC) optionally present in a sample, which probe comprises from 10 to 
30 polymerised moieties of formula (I) as defined in claim 6. 

20 

with the proviso that the Qs of adjacent moieties are selected so as to form a sequence of 
which a subsequence includes at least one nucleobase that is complemeritary to a 
nudeobase of M. tuberculosis 23S or 16 S rRNA differing from the conBsponding nucleobase 
of at least M. avium located within the following domains 

25 

Positions 149-158 in Figure 1A. 

Positions 328-361 in Figure lAand Figure IB. 

Positions 490-501 in Figure IB. 

Positions 637-660 in Figure 1C, 
30 Positions 762-789 in Figure 1 D. 

Positions 1068-1072 in Figure 10, 

Positions 1311-1329 in Figure IE, 

Positions 1361-1364 in Figure IF, 

Positions 1563-1570 in Figure 1F, 
36 Positions 1627-1638 in Figure 1G, 

Positions 1734-1740 in Figure 1H, 

Positions 2457-2488 in Figure 11, 

Positions 2952-2956 Jn Figure II. 
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Positions 3097-3106 in Figure 1 J, 
Positions 135-136 in Figure 2 A, or 
Positions 1287-1292 In Figure 2D, 

5 and further with the proviso that the probe comprising such subsequence is capable of forming 
detectable hybrids with a target sequence of said mycobacterial 23S or 16S rRNA, 
and a mixture of such probes. 

1 1 . Peptide nudeic acid probe according to any one of claims 1 to 6 for detecting a target 
10 sequence of 23S rRNA of one or more mycobacteria other than mycobacteria of the 

Mycobacterium tuberculosis Complex (MOTT) optionally present in a sample, which probe 
comprises from 10 to 30 polymerised moieties of formula (1) as defined in claim 6, 

with the proviso that the Qs of adjacent moieties are sielected so as to form a sequence of 
15 which a subsequence includes at least one nucleobase that is complementary to a 

nudeobase of M. avium 23S rf^NA differing from the corresponding nudeobase of at least M. 
tuberculosis located within the following domains 

Positions 99-101 in Figure 4A. 
20 Position 183 in Figure 4A. 

Positions 261-271 in Figure 4A, 

Positions 281-284 in Figure 4B, 

Positions 290-293 in Figure 4B, 

Positions 327-335 in Figure 4B, 
26 Positions 343-357 in Figure 48, 

Positions 400-405 in Figure 48 and Figure 4C, 

Positions 453-462 in Figure 4C, 

Positions 587-599 in Figure 4C, 

Positions 637-660 in Figure 4D, 
30 Positions 704-712 in Figure 4D, 

Positions 763-789 in Figure 4E. 

Positions 1060-1074 in Figure 4E, 

Positions 1 1 77-1 1 86 in Figure 4E, 

Positions 1259-1265 in Figure 4F, 
35 Positions 1311-1327 in Figure 4F, 

Positions 1345-1348 in Figure 4F, 

Positions 1361-1364 in Figure 4G. 

Positions 1556-1570 in Figure 4G. 
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Positions 1608-1613 in Figure 4H, 

Positions 1 626-1 638 in Figure 4H, 

Positions 1651-1669 in Figure 4H, 

Positions 1675-1677 in Figure 4H, 
5 Positions 1734-1741 in Figure 4H, 

Positions 1847-1853 in Figure 41. 

Positions 1967-1976 in Figure 41, 

Positions 2006-2010 in Figure 41. 

Positions 2025-2027 in Rgure 41. 
10 Positions 2131-2132 in Figure 4J, 

Positions 2252-2255 in Figure 4J. 

Positions 2396-2405 in Figure 4J and Figure 4K, 

Positions 2416-2420 in Figure 4K. 

Positions 2474-2478 in Figure 4K, 
1 5 Position 2687 in Figure 4K, 

Position 2719 in Figure 4K. 

Position 2809 in Figure 4L, 

Positions 3062-2068 in Figure 4L. or 

Positions 3097-3106 in Figure 4L, 

20 

and further with the proviso that the probe coniprising such subsequence is capable of forming 
detectable hybrids witii a target sequence of said nnycobacterial 23S rRNA. 
and a mixture of such probes. 

25 12. Peptide nucleic add probe according to any one of claims 1 to 6 for detecting a target 
sequence of 16S rRNA of one or more mycobacteria other than mycobacteria of the 
fi^ycdbacterium tuberculosis Complex (I^OTT) optionally present in a sample, which probe 
comprises from 10 to 30 polymerised moieties of formula (I) as defined in claim 6, 

30 with the proviso that the Qs of adjacent moieties are selected so as to torn a sequence of 
which a subsequence Includes at least one nucteobase that is complementary to a 
nucleobase of M. avium 16S rRNA differing from tiie corresponding nucleobase of at least M. 
tuberculosis located within the following domains 



35 Positions 1 35-1 36 in Figure 5A, 
Positions 472-475 in Figure 6A, 
Positions 1136-1144 in Figure 5A, 
Positions 1287-1292.in Figure SB. 
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Position 1313 in Figure 5B, or 
Position 1334 in Figure SB, 

and further wtth the proviso that the probe comprising such subsequence is capable of farming 
detectable hybrids with a target sequence of said mycobacterial 16S rRNA, 
and a mixture of such probes. 

13. Peptide nucleic acid probe according to any one of claims 1 to 6, 1 1 and 12 for detecting a 
target sequence of 23S or 16S rRNA of one or more mycobacteria other than mycobacteria of 
the Mycobacterium tuberculosis Complex (MOTT) optionally present in a sample, which probe 
comprises from 10 to 30 polymerised moieties of formula (I) as defined in claim 6, 

witii the proviso that the Qs of adjacent moieties are selected so as to form a sequence of 
which a subsequence includes at least one nucleobase that is complementary to a 
nucleobase of M. avium 23S or Ids rRNA differing from the corresponding nucleobase of at 
least M. tuberculosis located within ttie following domains 

Positions 99-101 In Figure 4A, 
Positions 290-293 in Figure 48, 
Positions 400-405 in Figure 48 and Figure AC, 
Positions 453-462 in Figure 4C, 
Positions 637-660 in Figure 4D, 
Positions 763-789 in Figure 4E, 
Positions 1311-1327 in Figure 4F, 
Positions 1361-1364 In Figure 4G, 
Positions 1 734-1 741 in Rgure 4H. 
Positions 2025-2027 in Figure 41, 
Positions 2474-2478 in Figure 4K. 
Positions 3062-2068 in Figure 4L. or 
Positions 1287-1292 in Figure 58. 

and further with ttie proviso that the probe comprising such subsequence is capable of forming 
detectable hybrids with a target sequence of said mycobacterial 23S or 16S rRNA, 
and a mixture of such probes. 

14. Peptide nucleic acid probe according to any one of claims 1 to 6 for detecting a target 
sequence of 23S. 16S or 5S rRNA of one or more mycobacteria of tiie Mycobacterium 
tuberculosis Complex (MTC) or for detecting a target sequence of 23S, 16S or 5S rRNA of 
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one or more mycobacteria other than mycobacteria of the Mycobacterium tuberculosis 
Complex (MdTT) optionally present in a sample, which probe comprises from 10 to 30 
pdlymeriseid moieties of formula (I) as defined in daim 6, 

5 with tiie proviso that the Qs of adjacent moieties are selected so as to torm a sequence of 
which a subsequence includes at least one nucleobase that is complementary to a 
nucleobase that differs from the corresponding nucleobase of 23S, 16S or 5S rRNA of said 
one or more mycobacteria located within the following domains 

10 positions 2568-2569 in Figure 6, 

Position 452 in Figure 7, 
Positions 473-477 in Figure 7, or 
Positions 8B5-iB66 in Figure 7, 

.15 • 

and further with the proviso that the probe comprising such subsequence is capable of fonming 
detectable hybrids with the target sequence of said mycobacterial 23S, 16S or 5S rRNA, 
and a mixture of such probes. 

Sid 15. Peptide nucleic acid probe according to any one of claims 1 to 14 of formula (11), (III), or 



25 



30 



35 




wherein Z. R^. R^ and R*. and Q is as defined in claim 6 with the provisos defined in claims 6 
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to 14. 

and a mixture of such probes. 

16. Peptide nucleic acid probe according to any one of claims 1 to 15, wherein Z is NH, NCH3 
5 or O, each R^ and independently designate H or the side chain of a natufally occuning 

amino acid, the side chain of a non-naturally occurring amino acid, or all<yl> arid each Q Is 
a naturally occurring nucieobase or a non-naturally occurring nucleobase with the provisos 
defined in claims 6 to 14, 
and a mixture of such prdbes. 

10 

17. Peptide nucleic add probe according to any one of claims 1 to 16, wherein Z is NH or O, 
and R^ is H or the side chain of Ala, Asp, Cys, Glu, His, HomoCys. Lys, Orn, Ser or Thr, and 
Q is a nucleobase selected from thymine, adenine, cytosine, guanine, uracil, iso-C and 2.6- 
diaminopurine with the provisos defined in claims 6 to 14, 

15 - and a mixture of such probes. 

1 8. Peptide nucleic acid probe according to any one of claims 1 to 17 of formula (V) 



Q 




wherein R^ is H or the side chain of Ala. Asp, Cys, Glu, His, HomoCys, Lys, Om, Ser or Thr, 
and Q is as defined in claim 17 with the provisos defined in claims 6 to 14, 
25 and a mixture of such probes. 

19. Peptide nucleic acid probe according to any one of cbiriis 1 to 18 further comprising one 
or more labels and a mbcture of such probes, which labels may be mutually identical or 
different, which probes optionally rnay comprise one or more linkers, and which probes may 

30 be mutually identical or different with the provisos defined in claims 6 to 14. 

20. Peptide nucleic acid probe according to any one of claims 1 to 19 for detecting a target 
sequence of one or more mycobacteria, the nucleobase sequence of said probe being 
substantially complemehtaiy to the nucleobase sequence of said target sequence. 

35 

21. Peptide nucleic acid probe according to any one of claims 1 to 20 for detecting a target 
sequence of one or more mycobacteria, the nucleobase sequence of said probe being 
complementary to the nucleobase sequence of said target sequence. 



W09g/lS648 



65 



PCT/DK97/Q0425 



22. Peptide nucleic add probes according to any one of claims 1 to 21. wherein the Qs of 
adjacent moieties are selected so as to form the following subsequences 

5 AGA TGC GGG TAG CAC (selected from posliions 149-158 in Figure 1 A), (Seq ID no 1) 

TGT TTT CTC CTC CTA (selected from positions 220-221 in Figure 1 A), (Seq ID no 2) 
ACT GCC TCT CAG CCG (selected from positions 328-361 in 

Figure 1A and Figure IB). ' (Seq ID no 3) 

TGA TAG TAG GCA GGT (selected from positions 453-455 In Figure 1 B). (Seq ID no 4) 

10 GGG ATT CAC AGCGGA (selected from positions 490-501 InFigurelB). (SeqlDnb5) 

TCA CCA CCC TCC TCC (selected from positions 637^60 in Figure 1C). (Seq ID no 6) 

TTA AGO TTG CGA CAT (selected from positions 706-712 In Figure 1C), (Seq ID no 7) 

ACT ATT CAC ACQ CGC (selected from positions 762-789 In Figure 1 D), (Seq ID no 8) 

CTC CGC GGT GAA CCA (selected from position 989 in Figure 1 D), (Seq ID no 9) 

15 GCT TTA CAC CAC GGC (selected from positions 1068-1072 In Figure ID). (Seq ID no 10) 

ACG C7T GGG GGC CTT (selected from position 1 148 In Figure 1 E), (Seq ID no 1 1) 

CCA CAC CCA CCA CAA (selected from positions 131 1-1329 In Figure IE). (Seq ID no 12) 

CCG GTG GCT TCG CTG (selected from positions 1361-1364 In Figure IF). (Seq ID no 13) 

ACT TGC CTT GTC GCT (selected from position 1418 In Figure 1 F), (Seq ID no 14) 

20 GAT TCG TCA CGG GCG (iselected from positions 1563-1570 In Figure IF). (Seq ID no 15) 

AAC TCC ACA CCC CCG (selected from positions 1627-1638 in Figure 1 G). (Seq ID no 1 6) 

ACT CCA CAC CCC CGA (selected from positions 1627-1638 in Figure 1G). (Seq ID no 17) 

ACC CCT TCG CTT GAC (selected from positions 1675-1677 In Figure 1G). (Seq ID no 18) 

CTT GCC CCA GTG TTA (selected from position 1 71 8 In Figure 1 G). (Seq ID no 19) 

26 CTC TCC CTA CCG GCT (selected from positions 1734-1740 In Figure 1 H), (Seq ID no 20) 

GAT ATT CCG GTC CCC (selected from positions 1967-1976 In Figure 1H). (Seq ID no 21) 

ACT CCG CCC CAA CTG (selected from positions 2403-2420 In Figure 1 H). (Seq ID no 22) 

CTG TCC CTA AAC CCG (selected from positions 2457-2488 in Figure 1 1), (Seq ID no 23) 

TTC GAG GTT AGA TGC (selected from positions 2457-2488 in Figure 1 1), (Seq ID no 24) 

30 GTC CCT AAA CCC GAT (selected from positions 2457-2488 In Figure 1 1). (Seq ID no 25) 

GGT GCA CCA GAG GTT (selected from positions 2952-2956 In Figure 1 1). (Seq ID no 26) 

CTG GCG GGA CAA CTG (selected from positions 2966-2969 In Figure 1J). (Seq ID no 27) 

TTA TCC TGA CCG AAC (selected from positions 3000-3003 In Figure 1J). (Seq ID no 28) 

GAC CTA TTG AAC CCG (selected from positions 3097^1 06 in Figure 1J). (Seq ID no 29) 

35 

GAA GAG ACC TTT CCG (selected from positions 76-79 In Figure 2A), (Seq ID no 30) 

CAC TCG AGT ATC TCC (selected from positions 98-101 In Figure 2A). (Seq ID no 31) 

ATC ACC CAC GTG TTA (selected from positions 136-136 In Figure 2A), (Seq ID no 32) 

GCA TCC CGT GGT CCT (selected from positions 194-201 In Figure 2B), (Seq ID no 33) 

40 CAC AAG ACA TGC ATC (selected from positions 194-201 in Figure 2B), (Seq ID no 34) 

TAA AGC GCT TTC CAC (selected from positions 222-229 In Figure 2B), (Seq ID no 35) 

GCT CAT CCC ACA CCG (selected from position 242 in Figure 2B), (Seq ID no 36) 
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CCG AGA GAA GCC GGA (selected from position 474 In Figure 2C). (Seq ID no 37) 

AGT CCC CAC CAT TAG (selected from positions 1 1 36-1 145 in Figure 20), (Seq ID no 38) 

AAC CTC GCG GGA tCG (selected from positions 1271-1 272 in Figure 2C), (Seq ID no 39) 

GGC TTT TAA GGA TTC (selected from positions 1287-1292 In Figure 2D), (Seq ID no 40) 

5 GAG CCC GAT CCG AAC (selected from position 1313 in Figure 2D). (Seq ID no 41) 

CCG ACT TCA CGG GGT (selected from position 1334 in Figure 2D), (Seq ID no 42) 

CGG AGG GGC AGT ATC (selected from positions 8&-90 in Figure 3). (Seq ID no 43) 

iO GAT CAA TGC TCG GTT (selected from positions 99-101 in Figure 4A), (Seq ID no 44) 

TTC CCC GCG TTA CCT (selected from position 183 In Figure 4A). (Seq ID no 45) 

TTA GCC TGT TCC GGT (selected from positions 261-271 in Figure 4A). (Seq ID no 48) 

GCA TGC GGT TTA GCC (selected from positions 281-284 in Figure 4B). (Seq ID no 47) 

TAG CCG GTT GTC CAT (selected from positions 290-293 in Figure 4B). (Seq ID no 48) 

1 5 Gt A GAG CTG AGA CAT (selected from positions 327-335 and 

343-357 In Figure 4B). (Seq |d no 49) 
GCC GTC CCA GGC CAC (selected from positions 400-405 in 

Figure 4B and Figure 4C), (Seq |D no 50) 

CTC GGG TGT TGA TAT (selected from positions 453-462 in Figure 4C). (Seq ID no 61 ) 

20 ACT ATT TCA CTC CCT (selected from positions 587-699 in Figure 4C). (Seq ID no 52) 

ACG CCA TCA CCC CAC (selected from positions 637-660 In Figure 4D). (Seq ID no 53) 

CGA CGT GTC CCT GAC (selected from positions 704-712 in Figure 4D), (Seq ID no 54) 

ACT ACA CCC CAA AGG (selected from positions 763-789 in Figure 4E). (Seq ID no 55) 

CAC GCT TTT ACA CCA (selected from positions 1060-1074 in Figure 4E), (Seq ID no 56) 

25 GCG ACT ACA CAT CCT (selected from positions 1 177-1 185 In Figure 4E), (Seq ID no 57) 

CGG CGC ATA ATC ACT (selected from positions 1259-1265 in Figure 4F), (Seq ID no 58) 

CCA CAT CCA CCG TAA (selected from positions 131 1-1327 in Figure 4F). (Seq ID no 59) 

CGC TGA ATG GGG GAC (selected from positions 1345-1348 in Figure 4F). (Seq ID no 60) 

GGA GCT TCG CTG AAT (selected from positions 1361-1364 in Figure 4G), (Seq ID no 61) 

30 CGG TCA CCC GGA GCT (selected from positions 1361-1364 in Figure 4G), (Seq ID no 62) 

GGA CGC CCA TAC ACG (selected from positions 1556-1570 in Figure 4G). (Seq ID no 63) 

GAA GGG GAA TGG TCG (selected from posftions 1608-1613 in Figure 4H), (Seq ID no 64) 

AAT CGC CAC GCC CCC (selected from positions 1626-1638 in Figure 4H), (Seq ID no 65) 

CAG CGA AGG TCC CAC (selected from positions 1651-1659 in Figure 4H). (Seq ID no 66) 

35 GTC ACC CCA TTG CTT (selected from positions 1675-1677 in Figure 4H), (Seq ID no 67) 

ATC GCT CTC TAC GGG (selected from positions 1734-1741 in Figure 4H). (Seq ID no 68) 

GTG TAT GTG CTC GCT (selected from positions 1847-1853 In Figure 41). (Seq ID no 69) 

ACG GTA TTC CGG GCC (selected from positions 1967-1976 In Figure 41), (Seq ID no 70) 

GGC CGA ATC CCG CTC (selected from positions 2006-2010 in Figure 41), (Seq ID no 71) 

40 AAA CAG TCG CTA CCC (selected from positions 2025-2027 in Figure 40. (Seq ID no 72) 

CCT TAC GGG TTA ACG (selected from positions 2131-2132 in Figure 4J), (Seq ID no 73) 

GAG ACA GTT GGG AAG (selected from positions 2252-2255 in Figure 4J). (Seq ID no 74) 
TGG CGT CTG TGC TTC (selected from positions 2396-2405 in 
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Figure 4J and Figure 4K). (3eq ygj 

CGA CTC CAC ACA AAC (selected from positions 2416-2420 m Figure 4K). (Seq ID no 76) 

GAT AAG GGT TCG ACG (selected from positions 2474-2478 in Figure 4K). (Seq ID no 77) 

ATC CGT TGA GTG ACA (selected from position 2687 In Figure 4K), (Seq ID no 78) 

CAG CCC GTT ATC CCC (selected from position 271 9 in Figure 4K), (Seq ID no 79) 

AAC CTT TGG GAC CTG (selected from position 2809 In Figure 4L), (Seq ID no 80) 

TAA AAG GGT GAG AAA (selected from positions 3062-3068 in Figure 4L), (Seq ID no 81) 

GTC TGG OCT ATC AAT (selected from positions 3097-3108 In Figure 4L), (Seq ID no 82) 

AGA TTG CCC ACG TGT (selected from positions 1 35-1 36 in Figure 5A), (Seq ID no 83) 

AAT CCG AGA AAA CCC (selected from positions 472-475 in Figure 5A). (Seq ID no 84) 

GCA TTA CCC GCT GGC (selected from positions 1 1 36-1 144 in Figure 5B). (Seq ID no 85) 

TTA AAA GGA TTC GCT (selected from positions 1287-1292 In Figure 6B), (Seq ID no 86) 

AGA CCC CAA TCC GAA (selected from position 1313 in Figure 5B), (Seq ID no 87) 

GAC TCC GAC TTC ATG (selected from position 1334 in Figure 5B), (Seq ID no 88) 

GTC TTT TCG TCC TGC (selected from positions 2568-2569 in Figure 6). (Seq ID no 89) 

GTC TTA TCG TCC TGC (selected from positions 2568 in Figure 6). (Seq ID no 90) 

GTC TTC TCG TCC TGC (selected from positions 2568 in Figure 6). (Seq iD no 91) 

GTC TTG TCG TCC TGC (selected from positions 2568 in Figure 6). (Seq ID no 92) 

GTC TAT TCG TCC TGC (selected from positions 2568 in Figure 6). (Seq ID no 93) 

GTC TCT TCG TCC TGC (selected from positions 2568 in Figure 6). (Seq ID no 94) 

GTC TGT TCG TCC TGC (selected from positions 2568 in Figure 6), (Seq ID no 95) 

TTG GCC GGT GCT TCT (selected from positions 452 in Rgure 7). (Seq ID no 96) 

TTG GCC GGT ACT TCT (selected from positions 452 in Figure 7). (Seq ID no 97) 

TTG GCC GGT CCT TCT (selected from positions 452 in Figure 7). (Seq ID no 98) 

TTG GCC GGT TCT TCT (selected from positions 452 in Figure 7). (Seq ID no 99) 

ACC GCG GCT GCT GGC (selected from positions 473-477 In Figure 7). (Seq ID no 100) 

ACC GCG GCT ACT GGC (selected from positions 473 in Figure 7), (Seq ID no 101) 

ACC GCG GCT CCT GGC (selected from positions 473 in Figure 7). or (Seq ID no 102) 

ACC GCG GCT TCT GGC (selected from positions 473 in Figure 7). (Seq ID no 103) 

CGG CAG CTG GCA CGT (selected from positions 474 in Figure 7), (Seq ID no 104) 

CGG CCG CTG GCA CGT (selected from positions 474 in Figure 7), (Seq ID no 105) 

CGG CTG CTG GCA CGT (selected from positions 474 in Figure 7), (Seq ID no 106) 

CGT ATT ACC GCA GCT (selected from positions 477 in Figure 7), (Seq ID no 107) 

CGT ATT ACC GCC OCT (selected from positions 477 in Figure 7), (Seq ID no 107) 

CGT ATT ACC GCT GCT (selected from positions 477 In Figure 7). (Seq ID no 109) 

TTC CTT TGA GTT TTA (selected from positions 865-866 in Figure 7). (Seq ID no 1 1 0) 

TTC CTT TAA GTT TTA (selected from positions 865 in Figure 7). (Seq ID no 1 1 1) 

TTC CTT TCA GTT TTA (selected from positions 865 In Figure 7). (Seq ID no 1 12) 

TTC CTT TTA GTT TTA (selected from positions 865 in Figure 7), (Seq ID no 1 13) 

TTC CTT AGA GTT TTA (selected from positions 866 in Rgure 7), (Seq ID no 1 14) 
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TTC CTT CGA GTT TTA (selected from positions 866 In Figure 7). (Seq ID no 1 1 5) 

TTC CTT GGA GTT TTA (selected from positions 866 In Figure 7). (Seq ID no 1 16) 

CAT GTG TCC TGT GGT (Seq ID no 1 17) 

CGT CAG CCC GAG AAA (Seq ID no 1 18) 

CAC TAG ACA CGC TCG (Seq ID no 1 1 9) 

TGG CGT TGA GGT TTC and (Seq ID no 120) 

AAC ACT CCC TTT GGA (Seq ID no 123) 

and a mixture of such probes. 

23. Peptide nucleic acid probes according to claim 22. wherein the Qs of adjacent moieties 
are selected so as to form the following subsequences 

f CA CCA CCC TCC TCC (Seq ID no 6) 
15 CCA CCC TCC TCC (modified Seq ID no 6) 

ACT ATT CAC ACG CGC (Seq ID no 8) 

CCA CAC CCA CCA CAA (Seq ID no 12) 

AAC TCC ACA CCC CCG (Seq ID no 16) 

ACT CCA CAC CCC CGA (Seq ID no 17) 

20 ACT CCG CCC CAA CTG (Seq ID no 22) 

CTG TCC CTA AAC CCG (Seq ID no 23) 

TTC GAG GTT AGA TGC (Seq ID no 24) 

GTC CCT AAA CCC GAT (Seq ID no 25) 

GACCTATTGAACCCG (Seq ID no 29) 

25 

GCA TCC CGT GGT CCT (Seq ID no 33) 

CACAAGACATGCATC (Seq ID no 34) 

GGC TTT TAA (3GA tTC (Seq ID no 40) 

30 GAT CAA TGC TCG GTT (Seq ID no 44) 

CGA CTC CAC ACA AAC (Seq ID no 76) 

GCA TTA CCC GCT GGC (Seq ID no 85) 

35 GTC TTA TCG TCC TGC (Seq ID no 90) 

GTC TTC TCG TCC TGC (Seq ID no 91) 

GTC TTG TCG TCC TGC (Seq ID no 92) 

GTC TAT TCG TCC TGC (Seq ID no 93) 

GTC TCT TCG TCC TGC (Seq ID no 94) 

40 GTC TGT TCG TCC TGC (Seq ID no 95) 



AAC ACT CCC TTT GGA 



(Seq ID no 123) 
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CATGtGtCCtGTGGT 
CGTCAGCCCGAGAAA 



(Seq ID no 117) 
(Seq ID no 118) 



5 CACTACACACGCTCG. 
TGG CGTTGAGGTTTC 



(Seq ID no 119) 
(Seq ID no 120) 



and a mixture of such probes. 



10 24. Peptide nucleic acid prot>es according to claim 22 or 23 selected from 



15 



20 



25 



30 



35 



Lys(Flu)"Lys(Flu)-TCA CCA GCC TCC TCC-NH^ 
Lys(Flu)-Lys(Flu)-CCA CCC TCC TCC-NHj 
Lys(Flu)-Lys(Flu)-ACT ATT CAC ACG CGC-NHa 
Lys(Flu)-ACT ATT CAC ACG CGC-NH2 
Lys(Flu)-Lys(Flu)-CCA CAC CCA CCA CAA-NH2 
Lys(Flu)-Ly8(Flu)-AAC TCC ACA CiCC CCG-NHj 
Lys(Flu)-Lys(FIu)^CT CCA CAC CCC CGArNHj 
Lys(Flu)-Lys(Flu)-ACT CCG CCC CAA CTG-NHj 
Lys(Flu)-Lys(Flu)-CTG TCC CTA AAC CCG-NHj 
Lys(Flu)-Lys(Flu)-TTC GAG GTT AGA TGC-NHz 
Lys(FJu)-TTC GAG GTT AGA TGC-NHj 
Lys(Flu)-Lys(Flu)-GtC CCT AAA CCC GAT-NHj 
Lys(Flu)-GTC CCT AAA CCC (SAT-NHj 
Lys(Ru)-GAC CTA TTG AAC CCG-NHj 

Lys(FJu)-Lys(Flu)-Giy-GCATCC CGT GGT CCT-NHj 
Lys(Flu)-Lys(Flu)-CAC AAG ACA TGC ATC-NHj 
^ Lys(Flu)-CACAAGACATGCATC>NH2 
Lys(Flu)-GGC TU TAA GGA TTC-NHj 
Lys(Rho)-GGC TTT TAA GGA TTC-NHj 

Flu-p-AIa-p-Ala-GAT CAA TGC TCG GTT-NH2 
Flu-3-Ala-p-Ala-CGA CTC CAC AGA AAC-NHa 

Flu-P^AIa-p-Ala-GCA TTA CCC GCT GGC-NH, 



(OK 446Miodlfled Seq ID no 6) 
(OK 575/modlfied Seq ID no 6) 
(OK 447/modified Seq ID no 8) 
(OK 688/mod(fied Seq ID no 8) 
(OK 448/modlfied Seq ID no 12) 
(OK 44gi/modlfied Seq ID no 16) 
(OK 309/modlfied Seq ID no 17) 
(OK 45a/mbdffied Seq ID no 22) 
(OK 305/modified Seq ID no 23) 
(OK 306/modlfled Seq ID no 24) 
(OK 682/modified Seq ID no 24) 
(OK 307/modifled Seq ID no 25) 
(OK 654/niodified Seq ID no 25) 
(OK 66Q/modified Seq ID no 29) 

(OK 223/modified Seq ID no 33) 
(OK 310/modified Seq ID no 34) 
(OK 655/modlfied Seq ID no 34) 
(OK 689/niodified Seq ID no 40) 
(OK 7Q2/modifted Seq ID no 40) 

(OK 624/modified Seq ID no 44) 
(OK 612/modified Seq ID no 76) 

(OK 623Miodified Seq ID no 85) 



Lys(Flu)-GTC TTT TCG TCC TGC-NHj 
Lys(Rho)-GTC TTA TCG TCC TGC-NHa 
40 Lys(Rho)-GTC TTC TCG TCC TGC-NHj 
Lys(Rho)-GTC TTG TCG TCC TGC-NHj 
Lys(Rho)-GTC TAT TCG TCC TGO-NHa 



(OK 746/modified Seq ID no 89) 
(OK 746/modified Seq ID no 90) 
(OK 746/modified Seq !D no 91) 
(OK 746/modified Seq ID no 92) 
(OK 747/modtried Seq ID no 93) 
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Lys(Rho)-GTC tCT TCG TCC TGC-NH2 
Ly8(Rho)^TC TGT TCG TCC TGC-NHj 



(OK 747/modlfied Seq ID no 94) 
(OK 747/moclified Seq ID no 95) 



Ly8(Flu)^C ACT CCC TTT GGA-NHj 



(OK 748/modified Seq ID no 123) 



wherein Flu denotes a 5-(and 6)-carboxyfluoroescein label and Rho denotes a rtiodamine 
label 

and a mixture of such probes. 

25. Use of a peptide nucleic acid probe according to any one of claims 1 to 24 or a mixture 
thereof for detecting a target sequence of one or more mycobacteria optionally present in a 
sample. 

26. Use Of a peptide nucleic acid probe or a mixture thereof according to claim 25 for detecting 
a target sequence of one or more mycobacteria of the Mycobacterium tuberculosis Complex 
(MTC), In particular a target sequence of M. tuberculosis. 

27. Use of a peptide nucleic acid probe or a mixture thereof according to claims 25 for 
detecting a target sequence of one or more mycobacteria other than mycobacteria of the 
Mycobacterium tuberculosis Complex, in particular a target sequence of one or more 
rnycobacteria of the Mycobacterium avium Confiplex. 

28. Method for detecting a target sequence of one or more mycobacteria optionally present in 
a sample comprising 

(1) contacting any rRWi or rDNA present in said sample with one or more peptide 
nucleic acid prdbes according to any one of claims 1 to 24 or a mbcture thereof under 
conditions, whereby hybridisation takes place between said probe(s) and said rRNA 
or rDNA, and 

(2) observing or measuring any formed detectable hybrids, and relating said 
observation or measurement to the presence of a target sequence of one or more 
mycobacteria in said sample. 

29. Method according to daim 28 for detecting a target sequence of one or more 
mycobacteria of the Mycobacterium tuberculosis Complex (MTC), in particular a target 
sequence of M. tuberculosis. 

30. Method according to claim 28 for detecting a target sequence of one or more 
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mycobacteria other than mycobacteria of the Mycobacterium tuberculosis Complex. 

31 . Method according to any one of claims 28 to 30. wherein the hybridisation takes place in 
situ. 

5 

32. Method according to any of of claims 28 to 30, wherein the hybridisation takes place in 
vitro. 

33. A method according to any one of claims 28 to 32, 

10 c h a r a c t e r I s e d in that a signal amplitying system is used for measuring the resulting 
hybridisation. 

34. Method according to any one of claims 28 to 33, wherein the sample is a sputum sample. 

16 35. Kit for detecting a target sequence of one or rhore mycobacteria, in particular a target 

sequence of one or more mycobacteria of the Mycobacterium tuberculosis Complex (MTC). in 
partfcular a target sequence of M. tuberculosis, and/or for detecting a target sequence of one 
or rriore mycobacteria other than mycobacteria of the Mycobacterium tuberculosis Complex 
(MOTT), In particular a target sequence of one or more nr^ycobacteria of the Mycobacterium 

20 avium Complex, 

characterised In that said kit comprises at least one peptide nucleic acid probe 
according to any one of claims 1 to 24, and optionally a detection system with at least one 
detecting reagent. 

25 36. Kit according to claim 35, 

characterised in that it further comprises a solid phase capture system. 
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130 



140 



150 



160 



GGGGAAACCCAGCACGAGTGATGTCGT OCTACCCGCATb T M 



1093 

422 ggggHracccagcaggagtgatgtcgtg 

422 GGGG 3 AACCCAGCACGAGTGATGTCGTG 

507 gggg@aacccBgcacgagtgatgtcgtg ri 

432 GGGGAAACCCAgCACGAG'ljavgGTCGTG 
207 GGGGAAACCCAGCACGAGTgpVTGTCGTG 
150 GGGGAAACCCAGCACGAGTGATGTCGTG 
2588 GGGGAAACCC^CACGAGTGATGTCGTGlr^CCR Ggcg C' 



ACCCGIpVTCT M 
'ACCCGjlMCT M 

iItacccgIiStct M, 

TCT M, 

M, 

CC|R|G|3C(3CT M. 



.tuberculosis 
avium 

.paratuberc. 
phlei 
leprae 
gastri 
kansasii 
smegmatis 



210 220 230 240 

1172 CATCTCAGTACCCGTAGGTififlAGAAAACAATTGTGATTCC M 



501 CATCTCAGTACCCGTAGGAG 

501 CATCTCAGTACCCGTAGGAG 

586 CATCTCAGTACCCGTAG^i 

511 CATCTCAGTACCCGTAGGAG 

286 CATCTCAGTACCCGTAGGAG 

229 CATCTCAGTACCCGTAGGAG 

2667 CATCTCAGtQcCCGTAGG 



lGAAAACAATTGTGATTCC m 
lGAAAACAATTGTGATTCC m 
lGAAAACAATTGTGATTCC M 
lGAATUE^CAATTGTGATTCC M. 

lGaaaacaaESgtgattcc m. 
tAaaacaaMgtgattcc m. 
iagaaaacarhtgtgattcc m. 



tuberculosis 
avium 

paratuberc. 

phlei 

leprae 

gastri 

kansasii 

smegmatis 



1289 

617 

617 

703 

629 

404 

347 

2785 



330 



340 



350 



360 



TGTGGGJls-GATATfiTnTnnRnffnTftrnnr.GCTr,ap;n-(^rt m nH^^r^m r.^A ^ 
fATATGTCTCAGCfflCTACC^ " 



TGTGGG 
TGTGGG 



3 ATATGTCTCAGC I CTAC 



TGTGGG|CCTgTgrGTCgCA5jc^CCgcCGG CfeA 
TGTGGGA|rijGgTATGTCTCAy(±CTACCgGG^G _ 
TGTGGGA TC GATA0GTCTCAGC I CTACCCGGCTGAC 
TGTGGGA rcjsATA : GTCTCAGC I CTACCCGGCTGAC^ 
TGTGGGA|c^ATp^TCQc^CgCTACC@GGCTG3G3 



^gctg; 

rGCTGAG 



-GG M- avium 
;GG M- paratuberc. 
_ M. phlei 
-GG M. leprae 
■GG M. gastri 
■GG M.Jcansasii 
JG M. smegmatis 



Figure 1A 
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370 380 390 400 

1^27 i^GTCAGAAAGTGTCGTGGTTAGCGGAAGTGGCCTGGGAT M. tuberculosis 

M- avium 
M-paratuberc. 



656 mTVGTCAGAAAGTGTCGTGGTTAGCGGAAGTGGCCTGGG; 

656 I AGTCAGAAAGTGTCGTGGTTAGCGGAAGTGGCCTGGGi .... « 

742 I AGTgAflRAAGlCAGllGTGGTTAGggGAAGTGGCCTGGGAT M.phlei 

668 [ijAGTCAGAAAGTGyCGTGGTTAGCGGTU^gTGGCCTGGGAT M. leprae 

443 cagtcagaaagtgtcgtggttaHcggaagtggcctgggat M.gastri 

386 CAGTCAGATUVGTGTCGTGGTTAyCGGAAGTGGCCTGGGAT M.kansaaii 

2823 CAGT|AGAAAgrGT@GTGGTTAGCGGAA@TGGC@rGGGAT M.smegmatis 



450 



460 



470 



480 



735 
820 
747 
522 
465 



1406 CGGCACCTGCCI^pVTCAATTCCCGAGTAGCAGCGGGCC M. tuberculosis 
735 CGGCACCTGCCTT^ATCAAlcacCCGA^^^ M. avium 

CGGCACCTGCCTb ^ATCAA| cfl|ccCGAGTAGCAGCGGGCC M . pa r at ube rc . 
^G|T|ccgGCgCTCACA6G^ M.phlei 
ijGGCACCTGCClteTATCAATTCCCGAGTAGCAGCGGGCC M. leprae 
CGGCACCTGCCT I GTATCAATTCCCGAGTAGCAGCGGGCC M . g ast ri 
CGGCACCTGCCT 1 GTATCAATTCCCGAGTAGCAGCGGGCC M . leans asi i 



2902 CGgc^CTGgCTgjG^GGT^TTCCCGAGTAGCAGCGGGCC M.smegmatis 



490 500 510 520 

1446 CGTGGAATdcGCTGTGART^nrfianiirrivrnrQCT^ vr n^^^^ 

775 CGTGGAATCffGCTGTGAATCfiGCCGGGACCACCCGGTAAG M. avium 
775 CGTGGAATCIGCTGTGAATCIGCCGGGACCACCCGGTAAG M-paratuberc . 
857 CGTGGAATCIGCTGTGAATCIGCCGGGACCACCCGGTAAG M.phlei 
787 CGTGGAATCIGCTGTGAATCIGCCGGGACCACCCGGTAAG M. leprae 
562 CGTGGAATCIGCTGTGAATCIGCCGGGACCACCCGGTAAG M.gastri 
505 CGTGGAATGIGCTGTGAATCIGCCGGGACCACCCGGTAAG M. kansasii 
2942 CGTGGAATqgGCTGTGAATCyGCCGGGACCACCCGGTAAG M.smegmatis 



Figure 1B 
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610 



620 



630 



640 



1566 GTACCTGAAACCGTGTGCCTACAATCCGTCAGAGCc trcCT M, tubergnl n^i c 
894 GTACCTGAAACCGTGTGCCTACAATCCGTCAGAGCCTCCT M avium 
894 GTACCTGAAACCGTGTGCCTACAATCCGTCAGAGCCTCCT M.paratuberc 
976 GTACCTGAAACCGTGTGCCTACAATCCGTCA§AGCc||cT M.phlei 

907 gtacctgaaaccgtgtgcctacaatccgtcagagcctciIt M. leprae 

682 GTACCTGAAACCGTGTGCCTACAATCCGTCAGAGCCiaciT M^gastri 
625 GTACCTGAAACCGTGTGCCTAGAATCCGTCAGAGCC^riT M.Jcansasii 
3062 GTACCTGAAACCGtG|GCgTACAATCCGTCAGAGCc|c2j^ M. smegmatis 



650 660 670 680 

1606 TTTCCTCTcnf^f^nr.r.i^nar^^r=rp^T.rr^^o^^^^^^^^^^^ M. tuberculosis 

|C [GfflGaSGTGATGGCGTGCCTTTTGA M. avium 

fez ^—HGTGSGGTGATGGCGTGCCTTTTGA M.paratuberc. 

1016 |CrT| ^gl^GIGSGGTGATGGCGTGCCTTTTGA M.phlei 

947 T ^GIGSGGTGATGGCGTGCgTTTTGA M. leprae 

722 Tf— GIGSGGTGATGGCGTGCCTTTTGA M.gastri 

665 |G{l|G|GjGGTGATGGCGTGCCTTTTGA M.kansasii 



CGTGR Ff- 



-pgGlgGGTGATGGCGTGCCTTTTGA M. smegmatis 



690 700 710 720 

1646 AGAATGAGCCTGCGAGTCAGGGAC aTGTCGCA^ GGTTAan M. 
AGAATGAGCCTGCGAGTCAGGGACATGTCGCAAGGTTAAC m! 

agaatgagcctgcgagtcagggacaBgtcgcHaggttaac m! 
agaatgagcctgcgagtcagggaca c gtcgc 3 aggttaac m ! 
agaatgagcctgcgagtcagggacaggtcgc s aggttaac m ! 

1046 AGAATGAGCCTGCGAGTCAGGGACATGTCGCGAGGTraAC m! 
972 AGAATGAGCCTGCGAGTCAGGGACATGTCGC 3 AGGTTAAC m! 
747 AGAATGAGCCTGCGAGTCAGGGACATGTCGC 3 AGGTTAAC M. 
690 AGAATGAGCCTGCGAGTCAGGGACATGTCGC 3 AGGTTAAC m! 
3132 AGAATGAGCCTGCGAGTCAGGGACATGTCGC 3 AGGTTAAC m! 



4 
959 
23 
959 



tuberculosis 
bo vis 
avium 

intracellular 

paratuberc. 

phlei 

leprae 

gastri 

kansasii 

smegmatis 



Figure 1C 
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1726 
84 

1039 

103 

1039 

1126 

1052 

827 

770 

3212 



770 



^ 780 790 800 

qGACCCACACGCGCRTACGCGCGTGTGAfll TAGTGGCGTGT M. 



CG ACCCA CACGCGCATACGCGCGTGTGAATAGTGGCGTGT M 
CG 




rlG^GlGT RGTGGCGTGT M 

IIGT AGTGGCGTGT M 

|CA^CjdCTlj3rGte|3(giGT[^ AGTGGCGTGT M, 

CG iAlgc^MAq^ OTTGGOG TTG GTG^ AGTGG@GTGT M . 

CG TAT — jCAC^jG TGAC^G ljG|l GTp- AGTGGCGTGT M . 

CGTAT-^-CACGCGg^GCGIGIGT AGTGGCGTGT M. 

CGTAT — ^CGCGC 3AGCg|i|g[i|gT AGTGGCGTGT M. 

CGfTAT — Iq CACACAt RlslA isTG.TGTG^ M. 



tuberculosis 

bovis 

avium 

intracellular 

paratuberc. 

phlei 

leprae 

gastri 

kansasii 

smegmatis 



970 



980 



990 



1000 



1926 

1228 

1228 

1322 

1244 

1019 

962 

3408 



ATTTAGGTGCAGCGTTGCGTGGTTCACCjStGGAGGTAGAG M. tuberculosis 
ATTTAGGTGCAGCGTTGCGTGGTTCACC0CGGAGGTAGAG M. avium 
ATTTAGGTGCAGCGTTGCGTGGTTCACC^CGGAGGTAGAG M. paratuberc - 
ATTTAGGTGCAGCGTgGCgrGHTTCfTTAT CGGAGGTAGAG M . phle i 
ATTTAGGTGCAGCGTTGCGTGGTTCAC^RCGGAGGTAGAG M. leprae 
ATTTAGGTGCAGCGTTGCGTGjijTTCACCRCGGAGGTAGAG M.gastri 
ATTTAGGTGCAGCGTTGCGTG I TTCACch CGGAGGTAGAG M. Jcansasii 
ATTTAGGTGCAGCGlgGCgrGllfrT ClrTGcl cGGAGGTAGAG M. smegmatis 



1050 



1060 



2005 
1307 
1307 
1401 
1323 
1098 
1041 
3486 



1070 



1080 



CAGCCAAACTCCGAATGCCG-TGGTG HTA-AAp CGTGGC!n M, 

cagccaaactccgmtgccg-tggtg-t^Kagcgtggca M , 
cagcca7tf\ctccgaatgccg~t ggtg -tap ragcgtggca m. 

TGflAAG@GTGGCA M. 



TARAAGCGTGGCA M, 



cagccaaactccgaatgccgmt fia g" 
cagccaaactccgaatgccg-tgg ^ 
cagccaaactccgaatgccg-tggtg-ta)i{a[-|gcgtggca M 

CAGCCAAACTCCGAATGCCG-TGGTG-TA Ig JgCGTGGCA M 
CAGCCAAACTCCGAATGCCGBTraG|c^Ji^G@GgGG@^ M 



tuberculosis 

avium 

paratuberc, 
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smegmatis 



Figure 1D 
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2082 
1385 
1385 
1479 
1401 
1175 
1118 
3566 



1130 1140 1150 1160 

ACAGCCCAGATCGCCGGCTAAGGCCCCyVAGCGTGTGCTA M. tuberculosis 
ACAGCCCAGATCGCCGGCTAAGGCCCC I RAGCGTGTGCTA M. avium 
ACAGCCCAGATCGCCGGCTAAG6CCCC I AAGCGTGTGCTA M, par at ube rc . 
ACAGCCCAGATCGCCGGCTAAGGCCCC I AAGCGTGTGCTA M . phle i 
ACAGCCCAGATCGCCGGCTAAGGCCCC I AAGCGTGTGCTA M. leprae 
ACAGCCCAGATCGCCGGCTAAGGCCCC R AAGCGTGTGCTA M - g as t ri 
ACAGCCCAGATCGCCGGCTAAGGCCCC R AAGCGTGTGCTA M- kansasii 
ACAGCCCAGATCGCCGG@TAAGGCCCC0AAGCGT@rG|TA M. smegmatis 



2241 
1544 
1544 
1638 
1560 
1334 
1277 
3726 



2280 
1583 
1583 
1676 
1600 
1367 
1310 
3764 



1290 1300 1310 1320 

ctctiagcacaccgccgaagccgcggcaca tIccaccttgt- m 
ctc?\agcacaccgccgaagccgcggcaca' 
ctcaagcacaccgccgaagccgcggc acai 

CTCAAGCACACCGCCGAAGCCGCGGCflE^ojCAycgrfllT^ 

ctcaagcacaccgccgaagccgcggcaca" 
ctcaagcacaccgccgaagccgci 
ctcaagcacaccgccgaagccgcl , 
@tcaagcagaccgccgaagccgcg' 



caictt-tr 
:atctt-tr 



:accttctr 
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1330 



1340 



1350 



1360 



"GGTGGGT^GGGTAGGfiRAflrf:TrrrTrBTiTir'B^r'riy^7^q M, tuberculosis 
TGTGGGTAGGGGAGCGTCCcScATTCAGCGAAG M. avium 
TGTGGGTAGGGGAGCGTCCCpCATTCAGCGAAG M. paratuberc . 

gCATgc|G@GAAG M. phlei 



CGGTi 

cggtggL 

XGG^GGTGTGGGTAGGGGAGCGTC^>.^*j^j^^y«««« i-i. ^uxc: j. 
GGGTGG^;GTGGGTAGGGGAGCGT|cCTCATTCAGCGAAG M, leprae 
AGGtJ— --^TGGGTAGGGGAGCGTCCCTCATTCAGCGAAG M. gastri 

r|gGT| TGGGTAGGGGAGCGTCCCTCATTCAGCGAAG M. leans as ii 

FT -F rGGGTAGGGGAGCGTCcgggATgcgGgsnixfl M. smegmatis 
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1370 



1380 



1390 



1400 



2319 
1623 
1623 
1716 
1640 
1402 
1345 
3796 



JGGGTGACCGGTGGTGGAGGGTGGGGGAGTGAGAAT M. tuberculosis 

C&=|CCGGGTGACCGGTGGTGGAGGGTGGGGGAGTGAGAAT M. avium 
Cg-CCGGGTGABCGGTGGTGGAGGGTGGGGGAGTGAGAAT M. paratuberc . 
CgSCCG§GTGAgjcGGTGGTGGAG6GTGgGGGAGTGAGAAT M.phlei 
CCTCCGGGT@ACCGGTGGTGGAGGGTGGGG^GTGAGAAT M- leprae 

^ccgggtgaccggtggtggaggHtgggggagtgagaat M. gastri 

OTGCCGGGTGACCGGTG6TGGAGGHTGGGGGAGTGAGAAT M. kansasii 
GqgcCG@GT^|c^TGGTGGAGGGTG|6GGAG^ M. smegmatis 



2359 
1662 
1662 
1756 
1680 
1442 
1385 
3836 



1410 



1420 



1430 



1440 



GGAGGCATGAGTAGCGJ 



3\AGGCAAGTGAGAACCTTGCCC M. tuberculosis 
VTTO3XfiRr:ixaf;'rr:iiftTinr'r«rp'Ti/=r"r'r^ M- avium 

M. paratuberc. 



GCAGGCATGAGTAGCGA I RAGGCAAGTGA6AACCTTGCCC 
GCAGGCATGAGTAGCGA I AAGGCAAGTGAGAACCTTGCCC 



|ccc M.phlei 



wx^^wGCATGAGTAGCGA I RAGGCAAGTGAGAACCTTguuu 
GCAGGCATGAGTAGCGA I RAGGCAAGTGAGAACCTTGCCC M. leprae 
GCAGGCATGAGTAGCGA I AAGGCAAGT6AGAACCTTGCCC M • g ast r i 
GCAGGCATGAGTAGCGA I^GGCAAGTGAGAACCTTGCCC M. kansasii 
GCAGGCATGAGTAGCGA^rgAGGCAAGTGAGAACCTTgcCC M . smegmat i s 



1570 



1580 



1590 



1600 



2519 
1821 
1821 
1915 
1840 
1602 
1545 
3996 



eg CCCGTGACGAATCA-GCGGTArTanrnnrrrnTiTiT^ r»r.A M. tuberculosis 
CGICCCTGAIGAATCA-GCGGTACTAACCACCCAAAACCG M- avium 
CG I CC C TGA I GAATC A-GCGG TACTAACCACCCAAAACCG M. paratuberc . 
CGyCCyTGA I GAATC trCATTd lfiCTAACCACCCAAAACCg M.phlei 
CGCCCGTGA I GAATCA--GCGGTACT@ACCACCCAAAACCG M . le p r ae 
CGCCCGTGA I GAATCA-GCGGTACTAACCACCCAAAACCG M - g a s t r i 
CGCCCGTGA 1 GAATCA-GCGGTACTAACCACCCAAAACCG M-kansasii 
CG|ccgrGflyGAATCA-GCGGTACTAACCfl|cCAAAACC@ M. smegmatis 
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1610 



2558 
1860 
1860 
1955 
1879 
1641 
1584 
4035 



1620 



1630 



1640 



GAT- CGATCAC- TCCCCTTCGGGGG-| TG TGGft 



GAT-- CGA C CA 1 -TCCCCTTCGGGGG GTCGHgI 
GAT~CGAgCflJl|-TCCCCTTCGG GTGG C G 

Gl^CGATCp- WCCQtTCGGGgE J: GTG^ G 
GAT- CGA@C%gTCCCCTTCGGGGG ^ §TGGAG 
GAT-CGATCAC-TCCCCTTCGGGGG A- GTGGAG 
GAT-C GATCAC-T CCCC TTCGGGGG cJgTGGAgI-, 
jftCCGOl GABcgc^cfr^TCGGGGpf^GTGGgGT' 



GTTC-1 ?G M 




-TGG M 
, C-TGG M, 

't||tgg M, 
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1650 



1660 



1670 



2594 
1896 
1896 
1986 
1917 
1677 
1620 
4071 



1680 



ggctgcgtgggaacttcgctggtagtagtcaagc Eaa& gg M. tuberculosis 
ggctgcgtgggaBcttcgctggtagtagtcaagctKE GGG M . av i urn 

GGCTGCGTGGGA C CTTCGCTGGTAGTAGTCAAGCyA I GGG M . par atube rc - 
GGCTGCGTGGGAjgC^G^G^^GTAGTAGTCAAGCGA I GGG M . phle i 
GGCTGCGTGGGAACTTCGpjTGGTAGTAGTCAAGCGATGGG M. leprae 



GGCTGCGTGG ^GC CTTCGCTGGTAGTAGTCAAGCGA I GGG M. gastri 
GGCTGCGTGGfiGC CTTCGCTGGTAGTAGTCAAGCGA I GGG M. kansasii 
GGCTGC@TGGGngCTTCG@TGGTAGTAGTCAAGCGAgGG M. smegmatis 



1690 1700 1710 1720 

2634 -GTGACGCAGGAAGGTAGCCGTACCAGTCAGTGGTA^OX- M. tuberculosis 
1936 -GTGACGCAGGAAGG^GCCGTACCAGTCAGTGGTAAnfe- M. avium 
1936 -GTGACGCAGGAASGgAGCCGTACCAGTCAGTGGTAAIA- M. paratuberc . 
2025 -GTGACGCAGGAAGGTAGCCGTACCAGTCAGTGGTAAIA- M.phlei 
1957 -GTGACGCAGGAAGGTAGCCGTACCAGTCAGTGGTAAIA- M. leprae 
1717 - GTGACGCAGGAAGG0AGCCGTACCAGTCA6TGGTAA I A- M . g as t r i 
1660 -GTGACGCAGGAAGGgAGCCGTACCAGTCAGTGGTARIR- M.kansasii 
4111 -GTGACGCAGGAAGGTAGCCGTACcgGTCAGTGGTAA^- M. smegmatis 



Figure 1G 
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1730 



1740 



1750 



1760 



2672 ^^CTGGGGCAAGC^fiiag^^ M. tuberculosis 

1974 -CTGGGGCAAGCcSgTAgF^GAGCGATAGGCAAATCCGT M. avium 
1974 -CTGGGGCAAGCCCGTAGt^'^GAGCGATAGGCAAATCCGT M.paratuberc . 
2063 -C§GGGG|AAgcCIGTAGGG§GAG|GATAGGCA7U\TCCGT M-phlei 
1995 "CTGG@GCAAGCCCGTAGGGAGAGCGATAGGCAAATCCGT M. leprae 
1755 -CTGGGGCAAGCCRGTAGGGAGAGCGATAG6CAAATCCGT M.gastri 
1698 -CTGGGGCAAGCCftGTAGGGAGAGCGATAGGCAAATCCGT M.kansasii 
4149 -cgGGgG@RAGCCyGTAGGGAG6^ATAGG§AAATCCGT M.smegmatis 



2908 
2208 
2208 
2298 
2231 
1910 
1934 
4385 



1970 1980 1990 2000 

AGGGG(iACCGGT\j:^TA^aTCiT^nnTir.rr.'P^^^^ M. tuberculosis 

agggggSccggaataBcgtgaacacccttgcggtgggagc m. avium 

AGGGGGlgCC GGAA TAgCGTGAACACCCTTGCGGTGGGAGC M.paratuberc - 
AGGGGGACC 30AC(} rAnCGTGA|GG^CTTGCGGgsG^GC M. phlei 
AGGGGGgcCGGAATATCGTGAACACCCTTGCGGTGGGAGC M- leprae 

M.gastri 

agggggaccggaataBcgtgaacacccttgcggtgggagc M. kansasii 
agggggacc^^t^cgtg|r^c@tt@cgg^c^gc M.smegmatis 



2410 



2420 



2430 



2440 



3345 

284 

2645 

393 

2645 

2737 

2668 

1910 

2372 

4822 



AC ICTCGACGCCAGTTGGGGCt SGAGTCGTTGTTGAAATarr! M, 
A CCTCG ACGCCAGTTGGGGCGGAGTC6TTGTTGAAATACC M , 
^ACAlGACGCCAGTTfflGfflGfflGGAGTCGTTGTTGAAATACC M. 
A rAC;^ GACGCCAGTT I G TAl GGAGTCGTTGTTGAAATACC M . 
^^lCT^ GACGCCAGTT I GgGll GGAGTCGTTGTTGAAATACC M . 
.x^gGACGCCAGTTgGGG I GGAGTCGTTGTTGAT^TACC M , 
]^GACGC@AGTTGGGGyGGAGTGGTTGTTGAAATACC M, 

AC CTCP ACGCCAGTTGGGGf 
Hc trCACI ACGCCAGa^GGGl] 



SGAGTCGTTGTTGAAATACC M 
jGAGTCGTTGTTGAAATACC M 



AC 



tuberculosis 

bovis 

avium 

intracellulare 

paratuberc. 

phlei 

leprae 

gastri 

Jcansasii 

smegmatis 



Figure 1H 



wo 98/15648 



9/31 



PCT/DK97/00425 



2450 



3385 

324 

2685 

433 

2685 

2777 

2708 

1910 

2412 

4862 



2460 2470 2480 

actctgatcgtattg qggatctarcctcgarccctg aatc m . 
actctgatcgtattgggcatctaacctcgaaccctg aatc m , 
actctgatcgtattggEIcaBctaacStcgaaccct^^ M. 
actctgatcgtattgg ft ca c ctaac 3 tcgaaccct - t atc m , 

ACTCTGATCGTATTGGyCA gCTAACyTCGAACCCTha ATC M , 
ACTCTGATCGTATTGGGCgTCTAACCTCG§ACCflTa^ M, 
ACTCTGATgGTATTG^CATCTAACCTCGAAC^T^^TC 

ACTCTGATCGTATTGG§Ci^TAAC|TCGAACCCTGAATC M, 

actctgatcgtattgggc^ctaacctcg|acc|t^tc M. 
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3425 

364 

2724 

472 

2724 

2817 

2748 

1910 

2452 

4902 



2490 



2500 



2510 



2520 



GGGTTTAGb GACAGTGCCTGGCGGGTAGTTTAACTffRRffr! M. 
GGGTTTAGGGACAGTGCCTGGCGGGTAGTTTAACTGGGGC M . 

gggtt^Sggacagtgcctggcgggtagtttaactggggc m . 

GGGTT C A : GGACAGTGCCTGGCGGGTAGTTTAACTGGGGC M . 
GGGTT C AgGGACAGTGCCTGGCGGGTAGTTTAACTGGGGC M . 
SGGTTjgAGGGACAGTGCCTGGgGGGTAGTTTAACTGGGGC M . 
gGGTTTAGGGACAGTGCCTGGCGGGTAGTTTAACTGGGGC M . 

M. 

GGGTTCAgGGACAGTGCCTGGCGGGTAGTTTAACTGGGGC M. 
gGGTTgAGGG ACAGTGCCTGGgGGGTAGTTTAACTGGGGC M , 
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2930 2940 2950 2960 

3864 AGTACGAGAGGACCGGGACGGACGAACCTC TfaGTGgE c^ trnhprrinl ncj-i 
3163 AGTACGAGAGGACCGGGACGGACGAACCTCTGGmmCCA M. avium 
3163 AGTACGAGAGGACCGGGACGGACGAACCTCTGGTMACCA M-paratuberc . 
3256 AGTACGAGAGGACCGGGACGGACGAACCTCTGGT Al ACCA M. phle i 
3187 AGTACGAGAGGACCGGGACGGACGAACCTCTGGT^gACCA M. leprae 
^^^^ M. gastri 

2891 agtacgagaggaccgggacggacgaacctctQgtgcacca M.kansasii 
5342 AGTACGAGAGGACCGGGACGGACGTU^CCTCTGGT^ACCA M. smegmatis 



Figure 1i 
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2970 



2980 



2990 



3000 




3904 
3203 

3203 GTTGTCC^CCAGGGSCACyGCTGGATAGCCACGTTCGGU M. 
3296 GTTGTCCCf^CCAGGGGCACCGCTGGATAGCCACGTTCGGR M. 
3227 GTTGTCgcycCAGGGGCACCGCTGGATAGCCACGTTCGGA M 

^ 

2931 GTTGTCCOTCCAGGGGCACCGCTGGATAGCgACGTTCGGH M, 
5382 GTTGTCCCHCCAGGGGCACgGCTGGATAGCCACGTTCGGy M, 
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3010 



3020 



3030 



3040 



3944 CA^ATAACCGCTGAAAGCATCTAAGCGGGAAACCTTCTC M. tuberculosis 
3243 CAGGATAACCGCTGAAAGCATCTAAGCGGGAAACCTTCTC M. avium 
3243 CAGGATAACCGCTGAAAGCATCTAA6CGGGAAACCTTCTC M- paratuberc . 
3336 CAGGATAACCGCTGAAAGCATCTAAGCGGGAAACCT^^C M- phlei 
3267 CA@GATAACCGCTGAAAGCATCTAAGCGGGJUVACGTTCTC M. leprae 
^^^^ M. gastri 

2971 CAGGATAACCGCTGAAAGCATCTAAGCGGGAAACCTTCTC M.kansasii 
5422 CAGGATAACCGCTGAAAGCATCTAAGCGGGAAACCT^C M. smegmatis 



4023 

609 

3322 

677 

3322 

3415 

3309 

1910 

3050 

5501 



3090 3100 3110 3120 

AGAACACGG qTTCAATAGGH a^GACCTGGAASCT M. 
AGAACACGGGTTCAATAGGTCAGACCTGGAAGCT M, 

a ■ '^'^gIataggScagacctggaagct M. 

tagg:cagacctggaagct m. 
ii^taggjccagacctggaagct m, 

TAGg^CAGACCTGgngGC^ M, 

M. 

^ ■ M. 
AGAACACGGGTTd^TAGGBCAGACCTGGAAGCT M. 
AGA@CACGGG@TEgATAG^gCAGACCTGGAAGC@ M. 



CCCGC- 
CCCGC- 
CCCGC- 
GCCGC- 
CCCGC- 
CCCGC- 



CCGiSC- 
CCCGC- 



•aga:cacggi 
'AGa:cacgggtt< 
-agaicacggg 
aga:cacggg 
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50 



60 



70 



80 



2 GCGGCGTGCTTjVRCACATGCAAGTCGAACGGAAR dGTga : M. 

.141 gcggcgtgcttaacacatgcaagtcgjvacggaaaggtctc m 
39 gcggcgt^ ttaacacatgcargtcgaacggaaaggBctc M 
1 EzEEEEEz3ttaacacatgcaagi@gaacggaaagR^ m 

39 

2 
40 
2 
2 

40 
1 



GCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGd:: CTC M 
GCGGCGTGCTTAACA@ATGCAAGTCGAACGGAAAGGg^C M. 
GCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGTCTC M . 
0CGGGGTGCTTAACACATGCAA6TCGAACGGAAAGGTCTC M, 
GCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGTCTC M, 
GCGGCGTGCTTAACACATGCAAGTCGAACGGgAAGGggrC M. 
I i GTGCTTAACACATGCjU^TCGAACGGAAAGGTCTC M, 
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42 
181 
79 
32 
.79 
42 
80 
41 
42 
80 
36 



90 100 110 120 

TCG(^BOT|\CTCGAGTGGCGAACGGGT M. 

TCGGAGATACTCGAGTGGCGAACGGGT M. 



T 

T 

T TCGGAGATACTCGAGTGGCGAACGGGT M. 

T — ^ TCGGgG-TACTCGAGTGGCGAACGGGT M. 

T TCGGAG3TACTCGAGTGGCGAACGGGT M. 

T --' — . ^^ tcggIgIghtactcgagtggcgaacgggt M. 

T |ftAAAAATCTTTll T ^ GAG ATACTCGAGTGG CGAACGGGT M. 



T 

T 



- TCGGAGfl OACTCGAGTGGCGAACGGGT M . 
- TCG GAGA qACTCGAGTGGCGAACGGGT M, 

- I | Gfe|grAC^CGAGTGGCGAACGGGT M. 

-TCGGAGATACTCGA@TGGCGAACGGGT M. 
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70 

209 

107 

59 

107 

70 

120 

69 

70 

104 

64 



130 140 150 160 

GAGTAACACGTGG(fr'GkTCTGCCCTSCAr:TTr!--aar:tiiTn^ 
GAGTAACACGTGGGTGATCTGCCCT6CACTTC- GGGATAA M . 
GAGTMCACGTGGgIcSaTCTG CCCTGCACt TC- GGGATAA M , 
6AGTAACACGTGGG Z7\ ATCTGCCCTGCACTTC- GGGATAA M . 
GAGTAACACGTGGGC;^ ATCT§CCCTGCACTTC- GGGATAA M, 
GAGTAACACGTGGGbai AtCTGCCCTGCACTTC-GGGATAA M , 
GAGTAACACGTGGGTOATCTGCCCTGCACTTC@GGGATAA M, 
GAGTAACACGTGGGCfl ATCTGCCCTGCAcE3c-GGGAT7U\ M , 
GAGTAACACGTGGGCTi RTCTGCCCTGCAC ftOC-GGGATAA M . 
GAGTAACACGTGGGTOATCTGCCCTGCACHTC-GGGATAA M. 
GAGTAACACGTGGGgGATCTGCCCTGCACTTC-GGGATAA M . 
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170 



180 



190 



200 



109 

248 

146 

98 

146 

109 

160 

108 

109 

143 

103 



GCCTGGGAAACTGGGTCTAATACCGGATAGGAC fcAdbfeiblft^ M , 

gcctgggaaactgggtctaataccggataggaccacggga m 
gcctgggaaactgggtctaataccggataggaccH^Sga M, 
gcctggga7\actgggtctaataccggataggacc ittfl gg m . 

GCCTGGGAAACTGGGTCTAATACCGGATAGGACC^jcbu^GA M, 
GCCTGGGAAACTGGGTCTAATACCGGATAGGACCAq gB M , 
GC@rGGGAAACTGGGTCTAATACCGGATAGGACg|cW\GG M. 
GCCTGGGA7\ACTGGGTCTAATACCGGATAGGACCAC n g s m . 
GCCTGGGAAACTGGGTCTAATACCGGATAGGACCAC rTOg M . 
GCCTGGGAAACTGGGTCTAATACCG^TAGGACCACyGGA M, 
GCCTGGGATVACTGGGTCTAATACCGGATAGGACCACGGGA M . 
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149 
288 
186 
138 
186 
149 
200 
148 
149 
183 
143 



210 220 230 240 

BjCATGTCTTGTGGTGGAAAG fcGCTTTAGC GGTGTGGGRT M. 
TGCATGTCTTGTGGTGGAAAGCGCTTTAGCGGTGTGGGAT M . 



::gcatgtct' 

CGCATGTCTTtr. 
CGCATGTCT 



|TGGTGGAAAGCp5TTTraCGGTGTGGGAT M 
IGGTGGAAAGC - - TTT ifeCGGTGTGGGAT M 
_ GGTGGAAAGC-irTT-GCGGTGlgGgAT M, 
t GCATGgCTTGTGGTGGAAAGC - - TTT I GCGGTGTGGGAT M , 
C GCATGTCTTGTGGTGGA7\AGC -T TTT I GCGGTG^fljGGAT M . 
C GCATgBcTTGTGGTGGAAAGC ~ TTT I GCGGTGTGGGAT M . 
cbcATGjgCTTGTGGTGGAAAGC - - TTT I GCGGTGTGGGAT M . 
C^CATGTC0T§TGGTGGAAAGd-mTT - GCGGTGTGGGAT M . 
ATGTCjcjTGTGGTGGAAAGp-^CTTT^CGGTGTGGGAT M, 
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250 260 270 280 

189 ^scccgcggcctatcagcttgttggtggggtgacggcct m, 

328 gagcccgcggcctatcagcttgttggtggggtgacggcct m, 

224 gBgcccgcggcctatcagcttgttggtggggtgacggcct m! 

176 GGGCCCGCGGCCTATCAGCTTGTTGGTGGGGTGAgGGCCT m! 

224 G3GCCCGCGGCCTATCAGCTTGTTGGTGGGGTGACGGCCT M, 

187 G3GCCCGCGGCCTATCAGCT0GTTGGTGGGGTGflgGGCCT m! 

239 G3GCCCGCGGCCTATCAGCTe^T1§GTGGGGT^CGGCCT M, 

.186 G 3 GCCCGCGGCCTATCAGCTTGTTGGTGGGGTGACGGCCT M . 

187 GsbcCCGCGGCCTATCAGCTTGTTGGTGGGGTGACGGCCT M. 

221 GSHCCCGCGGCCTATCAGCTTGTTGGTGGGGTGAgGGCCT M. 

" GgGCCCGCGGCCTATCAGCTTGTTGGTGGGGTl^CGGCCT M. 



181 
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389 
528 
424 
376 
424 
387 
439 
386 
387 
420 
381 



450 460 470 480 

AAACCTCTTTCACCATCGACGAAGGTCCGGGTlglrCTCGG M. 
AAACCTCTTTCACCATCGACGAAGGTCCGGGTTCTCTCGG M . 
AAACCTCTTTCACCATCGACGAAGGTCCGGGTTfflrCTCGG M 
AAACCTCTTTCACCATCGACGAAGGTCCGGGTT I TCTCGG 
AAACCTCTTTCACCATCGACGAA6GTCCGGGTT qTCTRbG 
AAACCTCTTTCACCATCGACGAAGGlSliCtV C 



AAACCTCTTTCACCATCGACGAAGGTCMGGG fiAlt rCTCGG 
AAACCTCTTTCACCATCGACGAAGGTCCGGGTTCTCTCGG M 
AAACCTCTTTCACCATCGACGAAGGTCCGGGTTCTCTCGG M 
AAACCTCTTTCACCATCGACGAAGGTCCGGGTtfflrCTCGG M 
AAACCTCTtTCACCATCGACGAAGGT|C66GTl^CTCGG M 
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1130 1140 1150 1160 

1069 TCTCATGTTGCCAGC RCGTAk^ddTfa GGGACTCGTf:;af;nf? m, 
1208 TCTCATGtTGCCAGCACGTAATGGTGGGGACTCGTGAGAG M< 
1104 TCTCATGTTGCCAGCBaGTAATGlcaGGGGACTCGTGAGAG M. 
1056 TCTCATGTTGCa\GcbdGTAATGCCGGGGACTCGTGAGAG M, 
1098 TCTCATGTTGCCAGCpdGTAATGOTGGGGACTCGTGAGAG M. 
1064 TCTCATGTTGCCAGqGGjGTAATGlcgGGGGACTCGTGAGAG M. 
1119 TCTCATGTTGCCAGCACGT7UVTGGTGGGGACTCGTGAGAG M. 

1066 TCTCATGTTGCCAGcEaGTAATGlcaGGGGACTCGTGAGAG M. 

1067 TCTCATGTTGCCAGCSGGTAATGCCGGGGACTCGTGAGAG M. 
1100 TCTCATGTTGCCAGCggGTAATGpqGGGGACTCGTGAGAG M, 
1061 TCTCATGTTGCCAGCACGTAATGGTGGGGACTCGTGAGAG M. 
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1250 



1260 



1270 



1280 



1189 
1328 
1224 
1176 
1218 
1184 
1239 
1186 
1187 
1220 
1181 



CAATGGCCGGTACAAAGGGCTGCGATGCC(yCGkGGTTAAG M 
CAATGGCCGGTACAAAGGGCTGCGATGCCGCGAGGTTAAG M 
CAATGGCCGGTACAAAGGGCTGCGATGCCG^AGGtTAAG M 
CAATGGCCGGTACAAAGGGCTGCGATGCCGCTi AGGTTAAG M 
CAATGGCCGGTACAAAGGGCTGCGATGCCGraAGGTTAAG M 
CAATGGCCGGTACAAAGGGCTGCGATGCCGCh AGGTTAAG M 
CAATGGCCGGTACAT^GGGCTGCGATGCCGC^GGTTAAG M 
CAATGGCCGGTACAAAGGGCTGCGATGCCGCGAGGTTAAG M 
CAATGGCCGGTACAAAGGGCTGCGATGCCGCGAGGTTAAG M 
CAATGGCCGGTACAAAGGGCTGCGATGCCGCGAGGTTAAG M 
CJU^TGGCCGGTACAAAGGGCTGCGATGCCGCGAGGTTAAG M 
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1290 1300 1310 1320 

1229 CGAATqC32H^^2|UlGCCGGTCTCAGTTC6GATE|5GGGTCT M, 
1368 CGAATCCTTA-AAA6CCGGTCTCAGTTCGGATCGGGGTCT M. 
1264 CGAATCCTTfi^^AAAGCCGGgCTCAGTTCGGAljljGGGGTCT M, 
1216 CGAATCCTTriAAAGCCGGTCTCAGTTCGGATISGGGTCT M, 
1258 CGAATCCTTriAAAGCCGGgCTCAGTTCGGATIGGGGTCT M, 
1224 CGAATCCTTriAAAGCCGGTCTCAGTTCGGATCGGGGTCT M. 
1279 CGAATCCTTriAAAGCCGGTCTCAGTTCGGATCGGGGTCT M. 

1226 CGAAtCCTTrTAAAGCCGGTCTCAGTTCGGATCGGGGTCT M. 

1227 CGAATCCTTrTAAAGCCGGTCTCAGTTCGGATCGGGGTCT M. 
1260 CGAATCCTTrTjAAAGCCGGTCTCAGTTCGGATCGGGGTCT M. 
1221 CGAATCCTTgf-AAAGCCGGtCTCAGTTCGGATCGGGGTCT M- 
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1330 1340 1350 1360 



1268 GCAACTCGACCCCfGlrGAAGTCGGAGTCGCTAGTAATCGCA tuberculosis 
1407 GCAACTCGACCCCGTGAAGTCGGAGTCGCTAGTAATCGCA M.bovis 

1304 gcaactcgaccccHtgaagtcggagtcgctagtaatcgca M. avium 

1256 GCAACTCGACCCqRTGMGTCGGAGTCGCTAGTAATCGCA M. intracellulars 
1298 GCAACT@GACCC^TGAAGTCGGAGTCGCTAGTAATCGCA M-paratuberc . 
1264 GCTUVCTCGACCCCGTGAAGTCGGAGtCGCTAGTAATCGCA M. scrofulaceum 
1319 GCAACTCGACCCCGTGAAGTCGGAGTCGCTAGTAATCGCA M. leprae 

1266 GCAACTCGACCCCGTGAAGTCGGAGTCGCTAGTAATCGCA M-lcansasii 

1267 GCTIACTCGACCCCGTGAAGTCGGAGTCGCTAGTAATCGCA M.gastri 
1300 GCAACTCGACCCCGTGAAGTCGGAGTCGCTAGTAATC6CA M- gordonae 
1260 GCAACTCGACCCCGTGAAGTCGGAGTCGCTAGTAATCGCA M. marinum 
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50 



60 



70 



80 



128 TTCCGAACCCGGAAGCTAAGCCTGCCAGCGCCGATGATAC M. tuberculosis 
39 T S CCGTtflCCCGGAMCTAAGCCTGCCAGCGCCGATGATAC M . bovi s 
41 TSCCGAACCCGGAAGCTAAGCCTGCCAGCGCC^TGATAC M.phlei 
3559 TRCCGAACCCGGAAGCTAAGCCTGgCAGCGCCGATGATAC M. leprae 
5743 TgCCGAACCCGGAAGCTAAGCCTGCCAGCBcCGAtGATAC M.smegmatis 



90 100 110 120 

TGCC(|£a;S£g|3G TGGAAAAGTAGGACACCGCCGAAC M. tuberculosis 

79 TGCCCCTCCGGgS TGGAAAAGTAGGgCACCGCCGAAC M.bovis 

81 TGCCCffcacgGG|Gj TGGAAAAGTAGGACACCGCCGAAC M.phlei 

3599 TGCCCrSiCGGG TGGAAAAGTAGGACACQGCCGTy^C M. leprae 

5782 Tgccqiptlc^ TGGATUkAGTAGGACAGCGCCGAAC M.smegmatis 
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382 
382 
1053 
467 
392 
167 
110 



90 



100 



110 



120 



GGGAGCTGTCAACCGAGqaiapATCCGASGATTTCCGAAT M. avium 
GGGAGCTGTCAACCGAGCATTGATCCGAGGATTTCCGAAT M. parat ube rc . 
GGGAGCTGTC7\ACCGAGdiT@GAtCCGAGGATTTCCGAAT M. tuberculosis 
GGGAGCTGTCAACCGAGC S T 3 GATCCGAGGATTTCCGAAT M . phle i 
GGGAGCtGTCAACCGAGCSrSGATCCGAGGATTTCCGAAT M. leprae 
GGGAGCTGTCAACCGAGC 3 T 3 GATCCGAGGATTTCCGAAT M. gast ri 
GGGAGCTGTCAACCGAGC 3 IgGATCCGAGGATTTCCGAAT M- kansasii 



2548 GGGAGCTGTCAACCGAGCgTTGATCCGAGGATlTCCGJUVT M.smegmatis 



170 



462 GAATATATAGGGTGCG- 
462 GAATATATAGGGTGCG- 
1133 GAATATATAG GGTGC G- 
547 GAATATATAGGlCGT'llG- 
472 GAATATATAGGGryCG- 

247 g;\atatatagggtgcg- 
190 gaatatatagggtgcg- 

2628 GAATATATAGgEg^cH- 



180 190 200 

"ggag(|3\acgcggggaagtgaaa m, 
•g6aggtaacgcggggaagtgaaa m. 
•ggaggSaacgcggggaagtgaaa m. 

•GG§GG3AACGCGGGGAAGTGAAA M. 
■GGAGGST^CGCGGGGAAGTGAAA M, 
■GGAGG 3 AACGCGGGGAAGTGAAA M. 
■GGAGG3AACGCGGGGAAGTGAAA M, 
•GGgGGgAACGCGG@GAAGTGAAA M. 
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541 

1212 

626 

551 

326 

269 



250 260 270 280 

-GTCAGTAGTGGCGAGCGA Ac-CGGAACA-Gb nTRnarrr; 

M. avium 

-GTCAGTAGTGGCGAGCGAAC-CGGAACA-GGCTAAACCG M. paratuberc . 
-G^AGTAGTGGCGAGCGAA^CGGAACA-GGCTAAACCG M. tuberculosis 
-GpkGTAGTGGCGAGCGAAEA^ M . phle i 

■"G^jlAGTAGTGGCGAGCGAAaSTOGA M. leprae 

-GTCAGTAGTGGCGAGCGAAC sICGGAACA I GGCTAAACCG M , g ast r i 
-GTHaGTAGTGGCGAGCGAAC 3 CGGAACA I GGCTAAACCG M. kansasii 



2706 @GTyAGtAGTGGCGAGCGAAcBcGGfl^iyGGCTAAAC0G m! smegmatis 
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290 300 310 320 

!AT^CATC^2a2a^CCGGGTAGGGGTTGTGTGTGCGGGGT M, 
CATG-CATGGACAACCGGGTAGGGGTTGTGTGTGCGGGGT M . 
-CATGG^lkACCGGGTAGGGGTTGTGTGTGCGGGGT M. 
G-CATG^'GA^CcBGG^gGGGGTTGTGTGTGCGC 

rA|GGTAGGGGTTGTGTGTGCGGt_ 
CCGGGTAGGGGTTGTGTGTGCGGGGT M. 
CCGGGTAGGGGTTGtGTGTGCGGGGT M. 
.CCGGGTAGGGGTTGTGTGTGCGGGGT M. 




CACAf-CATCg-C 
CAdG--CATGGbTi 
CAlqG-CATGGGf 

gNcat gFga' 
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617 
617 
1289 
703 
629 
404 
347 



330 



340 



350 

JL 



360 



TGTGG dATTGATATGl rCTCAGC lTCTACCTGGCTGAGG^ GG M. 

tgtgggattgatatgtctcagctctacctggctgagg-gg m , 



TGTGGGAp-|GMATGTCTCAGdC ^ 
TGTGGGls^CCT^BTGTCflCAS qSTCCGf C 




JGCTGAG] 
teGd^MlGGl 



GG M 

[g M 

'GAGG-GG M 



TGTGGGATTGgTATGTCTCAgCTCTACCTGi 

tgtggga-^gataBgtctcagctctaccBggctgagg-gg m 
— tgtgggatogatazgt m 

2785 TGTSGGA^^l^gTC^ggCTCTACCTGGCTGBG^GG M 
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370 380 390 400 

tagtcagaaagtgtcgtggttagcggaagtggcctggg;^ M. avium 

TAGTCAGAAAGTGTCGTGGTTAGCGGAAGTGGCCTGGGAC M . p ar at ube rc - 
1327 @AGTCAGAAAGTGTCGTGGTTAGCGG7U^GTGGCCTGGGAS m. tuberculosis 
742 TAGTgAgAAAG^^GTGGTTAGg^GAAGTGGCCTGGGAI M. phlei 
TAGTCAGAAAGTg1^GTGGTTAGCGGAA@TG6CCTGGGAI M. leprae 

3agtcagaaagtgtcgtggttaHcggaagtggcctgggai M - gastri 

C AGTCAGAAAGTGTCGTGGTTAyCGGAAGf GGCCTGGGA I M • leans as i i 



656 
656 



668 
443 
386 



2823 |gAGTgAGAAA@TGT:@6TGGTTAGCGGAA@TGGCgTGGGA I 



M. smegmatis 
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410 420 430 

696 GGCCCpcCGTAGACGGTGAGAGCCCGGTACGCGAAA- 
69 6 GGCCCGCCGTAGACGGTGAGAGCCCGGTACGCGA7UV- 
1367 GGfflcfflGCCGTAGACGGTGAGAGCCCGGTACGCGAAA- 
7 82 Gg|iJc I GCCGTAGf^GGTGAGAGCCCG^CgCGAAA- 
708 GGCC I GCCGTAGACGGTGAGAGCCCgGTOCGCGTWV- 

483 ggHc I gccgtagacggtgagagcccggtacgHgaaa- 

426 GG^C i|gCCGTAGACGGTGAGAGCCCGGTACG I GAAA- 
2863 GGCG^rgCCGTAGACGGTGAGAGCCCGGTACG I GAAA- 



440 



■ACC M. 
■ACC M. 
■ACC M. 
'AC§ M. 
gcC M. 
ACC M. 
ACC M. 
ACC M. 
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450 460 470 480 

735 CGGCACCTGCCqTATATCAAC^ M. avium 

735 CGGCACCTGCCTTATATCAACACCCGAGTAGCAGCGGGCC M . par at Ube rc 
1406 CGGCACCTGCCTK ^ATCAflj acCCGAGTAGCAGCGGGCC M. tuberculosis 
mGgrgCcRGCll^C ACAGG— 'I CCCGAGTAGCAGCGGGCC M. phle i 
IgGGCACCTGCCTTSlTATCAAfrT CCCGA6TAGCAGCGGGCC M. leprae 
CGGCACCTGCCTTGTATCAATT CCCGAGTAGCAGCGGGCC M. gastri 
CGGCACCTGCCTT jTATCAfif TT CCCGAGTAGCAGCGGGCC M. kansasii 



820 
747 

522 
465 



2902 CGgc^CTGfflCTT bATGGTGTlb cCGAGTAGCAGCGGGCC M- smegmatis 



570 580 590 600 

855 GAGGGAATGGTGAAAAGTACCCCGG dftGGG-AGTGAM^ M. avium 

855 GAGGGAATGGTGAAAAGTACCCCGGGAGGG-AGTGAAATA M. paratuberc . 

1526 GAGGGAATGGTGAAAAGTACCCCGGGAGGG|aGTGAAA0A M. tuberculosis 

937 GAGGGAATgGTGAAAAGTACCCCGGGAGGG-AGTGAAftj^ M. phlei 

867 GAGGGAATGGtGAAAAGTACCCCGGGAGGG&VGTGAAATA M . leprae 

642 GAGGGAATGGTGAAAAGTACCCCGG6AGG63AGTGAA2\Ba M- gastri 

585 GAGGGAATGGTGAAAAGTACCCCGGGAGGGSAGTGAAAGA M. kansasii 

3022 gagggaatggtgaaaagtaccccgggaggg|^gtgaaa3\ m. smegmatis 
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610 



620 



630 



640 



894 GTACCTGRRACCGTGTGCCTACflATCCGTCAGAGC drCCT M- avium 

894 GTACCTGAAACCGTGTGCCTACAATCCGTCAGAGCCTCCT M. paratuberc - 

1566 GTACCTGAAACCGTGTGCCTACAATCCGTCAGAGCCTCCT M. tuberculosis 

976 GTACCTGA7U\CCGTGTGCCTACAATCCGTCA@AGCC§^T M. phlei 

907 GTACCTGAAACCGTGTGCCTACAATCCGTCAGAGCCTC i|t M . leprae 

682 GTACCTGAAACCGTGTGCCTACAATCCGTCAGAGCd^ T M. g ast ri 

62 5 GTACCTGAAACCGTGTGCCTACAATCCGTCAGAGCC CTT T M . kans as i i 

3062 GTACCTGAAACCGTGgGCgrACAATCCGTCAGAGCq^^ M-smegmatis 



934 
934 



650 



660 



670 



680 



TGATGGCGTGCCTTTTGA M. avium 



g ■ ' ^^ GTGGGGTGATGGCGTGCCTTTTGA M. paratuberc . 

1606 iTT TCCTCTCCGGAG GAlGBGEjGGTGATGGCGTGCCTTTTGA M. tuberculosis 

1016 C TH ISTAlGTGGGGTGATGGCGTGCCTTTTGA M- phlei 

947 5~ GTGGGGTGATGGCGTGC@TTTTGA M. leprae 

722 y GTGGGGTGATGGCGTGCCTTTTGA M.gastri 

665 c GTGGGGTGATGGCGTGCCTTTTGA M. Jcansasii 

3102 IftCGTGll ^GTGGGGTGATGGCGTGCCTTTTGA M.smegmatis 



690 700 710 720 

959 AGAATGAGCCTGCGAGTCAGGG JcACGTCGCGh GGTTAAC M, 
23 AGAATGAGCCTGCGAGTCAGGGACACGTCGCGAGGTTAAC M, 
959 AGAATGAGCCTGCGAGTCAGGGACACGTCGCGAGGTTAAC M. 
1646 AGAATGAGCCTGCGAGTCAGGGACAfflGTCGCHtlGGTTAAC M. 
4 AGAATGAGCCTGCGAGTCAGGGACAIGTCGC^GGTTAAC M. 
1046 AGAATGAGCCTGCGAGTCAGGGACAIGTCGCGAGGTTAAC M. 
972 AGAATGAGCCTGCGAGTCAGGGACA t GTCGCGAGGTTAAC M . 
747 AGAATGAGCCTGCGAGTCAGG6ACA I GTCGCGAGGTTAAC M, 
690 AGAATGAGCCTGCGAGTCAGGGACJV I GTCGCGAGGTTAAC M. 
3132 AGAATGAGCCTGCGAGTCAGGGACA I GTCGCGAGGTTAAC M. 
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1039 

103 

1039 

1726 

84 

1126 

1052 

827 

770 

3212 



770 

C aCRTCCCCTTTGGG- 



780 



790 



800 



GT^AGTcnnriTfST m. 

CGCATCCCCTTTGGG GTGTAGTGGCGTGT M. 

CG CAT CCc StTTGGG — ; — G TGTAGTGGCGTGT M. 

GGhCdcHcft!cGdGpATACGCGCGTGflGp^ M. 
CG RCqCp OACGgG CATACGCGCGTGT GftqTAGTGGCGTGT M . 
CG ]^CC RACCItG r j ^^ - ISGGGTTq GTGTAGTGGgGTGT M. 

cgiatcEc|g@gtgWgc^ gtgtagtggcgtgt m. 

cgiatcacacq^^gcgi gtgtagtggcgtgt m, 

CG I ATqG C 3CGqGhGCG1 GTGTAGTGGCGTGT M . 

CGmATCCry j^CA^G^^ GTGTAGTGGCGTGT M, 
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1307 
1307 
2005 
1401 
1323 
1098 
1041 
3486 



1050 1060 1070 1080 

cagccaaactccgaatgc dg-tggtg-tratxagctr tggca m, 
cagccaaactccgaatgccg-tggtg-taaaagcgtggca m. 
cagccaaactccgaatgccg- tggtg -taHaagcgtggca M. 
cagccaaactccgaatgccg§t^^=|-t^aag@gtggca m. 
cagcc7uu\ctccgaatgccg-tggtg-taaaagcgtggca m, 
cagccaaactccgaatgccg-tggtg-taejaplgcgtggca m . 
cagccaaactccgaatgccg-tggtg-taji a -gcgtggca m , 
cagccaaactccgaatgccg|t^g|^«^g@g@gg^ M, 
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1425 
1425 
2122 
1519 
1441 
1215 
1158 
3606 



1170 1180 1190 1200 

AGTGGAAAAGGATGTc feAGTCGCAGR -GAnanrrnggnrzri M. avium 
AGTGGAAAAGGATGTGTAGTCGCAGA-GACAACCAGGAGG M . par atube rc . 



AGTGGgAAAGGATGTG&^GTCGCJ 
AGTGGAAAAGGATGTGCAGTCGC ' 
AGTGGAAAAGGATGTG C AGTCQC 



^ t-GACAACCAGGAGG M. tuberculosis 
SAgGACAACCAGGAGG M, phlei 

_r .-Jr-GACAACCAGGAGG M. leprae 

AGTGGOTLAAGGATGTGZAGTCGCAGA-GACAACCAGGAGG M. gastri 
AGTGGlgAAAGGATGTG Z AGTCGCAGA-GACAACCAGGAGG M.kansasii 
AGTGGAAAAGGATGTG^GTCGCAGA@GA§AACCAGGAGG M. smegmatis 
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1250 



1260 



1270 



1280 



1504 CTCACTGGTCAAGTGATT iATGCGCCb ATAATGTAGCGGGG M- aviioitl 

1504 CTCACTSGtCAAGTGATTATGCGCCGATAATGTAGCGGGG M. paratuberc . 

2201 ctcactggtcaagtgattStgcgccgataatgtagcgggg m. tuberculosis 

1598 CTCACTGGTCAAGTGATT3TGCGC@GATAATGTAGCGGGG M.phlei 
1520 CTCACTGGTC7U\GTGATT3TGCGCCGATAATGTAGCGGGG M. leprae 
1294 GTCACTGGtCAAGTGATTSrGCGCCGATAATGTAGCGGGG M.gastri 
1237 CTCACTGGTCAAGTGATT3TGCGCCGATAATGTAGCGGGG M.kansasii 
3686 grCACTGGTGWVGTGATTjgTGCGCCGATAgTGTgGCGGGG M . smegmat i s 



1544 
1544 
2241 
1638 
1560 
1334 
1277 
3726 



1290 1300 1310 1320 

ctcaagcacaccgccgaagccgcggcacat Itcatctt-ta m, 
ctgaagcacaccgccgaagccgcggcacattcatctt" ta m , 
ctcaagcacaccgccgaagccgcggcacatBcaH^ m, 
ctgaagcacaccgccgaagccgcggcaF^ca sc^ I xg M, 
ctcaagcacaccgccgaagccgcggcacattca C CTT C TA m , 

CTCAAGCACACCGCCGAAGCCGCGfilCAj- |A C clc - =h M . 

CTCAAGCACACCGCCGAAGCCGCGg CT\ 4c\ C C 3C - - A M . 

@TCAAGCACACCGCCGAAGCCGCGGRa h~Gc| CT^ M. 
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1330 1340 1350 1360 

cggtggai tgtgggtaggggagcg' accccc attcagcgju^g m. 
cggtggatgtgggtaggggagcgtcccccattcagcgaag m , 
gtggStgtgggtaggggagcgtcc^cattcagcgaag M . 

|GGg3GgTGTGGGTAGGGGAGCGTCcFGCAT|c|G@GAAG M. 
GGTGGATGTGGGTAGGGGAGCGTgcdl CATTCAGCGAAG M , 

[GGTl ^fTGGGTAGGGGAGCGTCCC I CATTCAGCGAAG M . 

Gt[ TGGGTAGGGGAGCGTCCCl CATTCAGCGAAG M. 

fTT I TGGGTAGGGGAGCGTCCFG-lRTl^GgGAAG M. 
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1370 



1380 



1390 



1400 



1623 
1623 
2319 
1716 
1640 
1402 
1345 
3796 



Bs^^^GGGTGACCGGTGGTGGAGGGTGGGGGAGTGAGAAT M . 
CT-CCGGGTGAgCGGTGGTGGAGGGTGGGGGAGTGAGAAT M . 
^SCCGGGTGACCGGTGGTGGAGGGTGGGGGAGTGAGAAT M . 
C ZG CCG@GTGA@GGGTGGTGGAGGGTG@GGGAGtGAGAAT M , 
C Z1 CCGGGT@ACCGGTGGTGGAGGGTGGGG^GTGAGAAT M . 
CgGCCGGGTGACCGGTGGTGGAGGHTGGGGGAGTGAGAAT M. 
^3 CGGGGTGACCGGTGGTGGAGGyTGGGGGAGTGAGAAT M . 
C^gCCGgGT0C^GTGGTGGAGGGTG@GGGAGTGAGAAT M, 
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1530 



1540 



1781 
1781 
2479 
1875 
1800 
1562 
1505 
3956 



1550 



1560 



CGATGGACAACGGGTTGATATTCCCGTACCCGTGT kTGGG^ M. avium 
CGATGGACAACGGGTTGATATTCCCGTACCCGTGTATGGG M . par atube rc . 

cgatggacaacgggttgatattcccgtacccgtgt|tggg m. tuberculosis 
cgatggacaacgggttgatattcccgtacccgtgtatgHg M . phle i 

CGATGGACAACGGGTTGATATTCCCGTACCCGTGTBTGyG M. leprae 
CGATGGACAACGGGTTGATATTCCCGTACCCGTGT S TGGG M . g a s t r i 
CGATGGACAACGGGTTGATATTCCCGTACCCGTGTjGlTGGG M. kansasii 
CGATGGACAACGG6TTGATATTCCCGTACCCGTGTATG|g M. smegmatis 



1570 



1821 CGTCCCTGAro AATCA 
1821 CGTCCCTGATGAATPA 
2519 CGgccgTGAgGAATCA' 
1915 CGTCCCTGATGAATCf" 

1840 cgBccBtgatgaatca- 
1602 cgccc3tgatgaatca 

1545 CGgCCSTGATGAATCA- 
3996 CGTCCftTGATGAATCA- 



1580 



1590 



1600 

■GCGGTACTAACCACCCAAAACCG M 



avium 

GCGGTACTAACCACCCAAAACCG M. paratuberc. 
GCGGT ACTAACCACCCAAAACCG M. tuberculosis 
^T^rlCTAACCACCCAAAACCg M. phlei 
■GCGGTACT|aCCACCCAAAACCG M, leprae 
GCGGTACTAACCACCCAAAACCG M. gastri 

gcggtactaaccacccaatuvccg M.kansasii 
GCGGTACTAACCA@CCAAAACC@ M.smegmatis 



Figure 4G 
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1860 
1860 
2558 
1955 
1879 
1641 
1584 
4035 



1610 1620 1630 1640 

GAT"CG dcCAT-TC CCCTTCGGGG qC"GTGGCGATT-Cfe G M. 
GAT-CGACCAT-TCCCCTTCGGGGGC-GTGGCGATT-CGG M. 
GAT-CGA 
G^-CGA 
GAT-CGACCA': 
GAT-'CGAjljCj 
GAT-CGA I CJ 



•tccccttcgggg 
^ccQttcggg 



-TCCCCTTCGGGG 



_^ , hTCCCCTTCGGGGGC-GTGGffcaTqj M 

ACCGlj GACC pCA^ c F^^ TCGGG Gl — Ol GTGGCG I T 5GT GG M 




:GSTT|G'-bG M 



JTCCCCTTCGGGGGCraTGG^GSTT-CGG M 



stB-'Hgg M 
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1680 
J5GG M. avium 



1650 1660 1670 

GGCTGCGTGG lGACCTTCGCl rGGTAGTAGTCAAG d ^ 

GGCTGCGTGGGACCTTCGCTGGTAGTAGTCAAGCAATGGG M - parat Ube rc - 
GGCTGCGTGGGA@CTTCGCTGGTA6TAGTCAA6C0A§GGG M. tuberculosis 
GGCTGCGTGGGACC^G^^lGf^GTAGTAGTCAAGC 3 ATGGG M . phle i 



1896 
1896 
2594 
1986 

1917 _ 

1677 GGCTGCGTGGhdcCTTCGCTGGTAGTAGTCAAGciskTGGG M. gastri 

1620 

4071 



GGCTGCGTGGG3 



GGCTGCGTGGi 



•GGTAGTAGTCAAGCG ATGGG M. leprae 



!CTTCGCTGGTAGTAGTC7\AGC3ATGGG M- kansasii 



GGCTGGg|TGGGACCTTCG@TGGTAGTAGTC7lAGCyATGGG M. smegmatis 



1690 



1700 



1710 



1720 



1936 
1936 
2634 
2025 
1957 
1717 
1660 
4111 



1974 
1974 
2672 
2063 
1995 
1755 
1698 
4149 



-GTGACGCAGGAAGGCAGCCGTACCAGTCAGTGGTAATA- M. avium 
-GTGACGCSlGGAAGGCAGCCGTACCAGTCAGTGGTAATA- M. paratuberc . 
-GTGACGCAGGAAGG^GCCGTACCAGTCAGTGGTAAgA- M . tuberculosis 
-GTGACGCAGGAAGG I AGCCGTACCAGTCAGTGGTAATA- M . phlei 
-GTGACGCAGGAAGG0AGCCGTACCAGTCAGTGGTAATA- M. leprae 
-GTGACGCAGGAAGGCAGCCGTACCAGTCAGTGGTAATA- M . g as t r i 
-GTGAGGCAGGAAGGCAGCCGTACCAGTCAGTGGTAATA- M. kansasii 
-GTGACGCAGGAAGG@AGCCGTACC§6tCAGTGGTAATA- M- smegmatis 

■ 1 1 — — , ^ 

1730 1740 1750 1760 

-CTGGGGCAAGCC CGTAG—AK AGCGATAGGCAAATCCGT M. avium 
-CTGGGGCAAGCCCGTAG — AGAGCGATAGGCAAATCCGT M. paratuberc . 

-ctggggctvagc^tagIs^gagcgataggcaaatccgt M. tuberculosis 

-C@GGGG@AAgcCgjGTAG 3MGAG@GATAGGCAAATCCGT M . phle i 

-ctgg@gcaagcccgtagsg6gagcgataggcaaatccgt M. leprae 

"CTGGGGCAAGCcEIgTAG 3G AGAGCGATAGGCAAATCCGT M . g as t r i 
-CTGGGGCAAGCC R GTAG 3G AG AGCG ATAGGCAAATCCGT M . Jcans as i i 
- C@GG@G@AAGCCgGTAG^GG|AGg^GATAGG|^\AATCCGT M . smegmat i s 
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1810 1820 1830 1840 

2051 CG-AATTCGGTGATCCTCTGCTGCCAAGAAAAGCCTCTA- M. avium 
2051 CG-AATTCGGTGATCCTCTGCTGCCAAGAAAAGCCTCTA- M.paratuberc . 
2751 CG-AATTCGGTGATCCTCTGCTGCCAAGAAAAGCCTCTA- M- tuberculosi 
2141 CG-7UITTCGGTGATCCT§TGCTG@c|aGAAMGCCTCTA-^ M.phlei 
2074 CG-AATTCGGT@RgcCTCTGCTGCCAAGAAAAGCCTCTA- M. leprae 
1834 CG-AATTCGGTGATCCTCTGCTGCCAAGAAAA6CCTCTA- M.gastri 
1777 CG-AATTCGGTGATCCTCTGCT6CCAAGAAAAGCCTCTA- M.kansasii 
4228 CG-AATTCGGTGATCCI§TGCTGCC|aGAAAAGCCTCTA- M.smegmatis 



2089 
2089 
2789 
2179 
2112 
1872 
1815 
4266 



1850 1860 1870 1880 

GCGAGcfeSaSaSSPGGCCCGTACCCCAAACCAACACAGGT M, 
GCGAGCACATACACgGCCCGTACCCCAAACCAACACAGGT M . 
GCGAGCACA@ACACGGCCCGTACCCCAAACCgACACAGGT M, 
GCgAGcBCATACACGGCCCGTACCCCAAACCAACACAGGT M, 
GCGAGCAgng^|§GX3GCCCGTACCCCAAACCliACACAGGT M. 
GCGAGCACA C ACACGGCCCGTACCCCAAACC 3 ACACAGG M . 
GCGAGCACAjgACACGGCCCGTACCCCAAACq^CACAGGT M, 
GCGAGgACATACACGGCCCGTACCCCAAACCAACACAGGT M. 



avium 

paratuberc. 

tuberculosis 

phlei 

leprae 

gastri 

kansasii 

smegmatis 



1970 1980 1990 2000 

2208 AGGGGdc'cCGGAATACrxSTGAanannnTTrarrsrarprifzfzTirzn M- avium 
2208 AGGGGGCCCGGAATACCGTGTUICACCCTTGCGGTGGGAGC M.paratuberc - 
2908 AGGGGgHcCGGAATA@CGTGAACACCCTTGCGGTG6GAGC M. tuberculosis 
2298 AGGGGGnCCg^|TACCGTGA|G§c|CTTGCGG|GG^GC M.phlei 
2231 AGGGGGgCCGGAATAgCGTGAACACCCTTGCGGTGGGAGC M. leprae 
^^^^ _ M. gastri 

1934 AGGGGGHcCGGAATACCGTGAACACCCTTGCGGTGGGAGC M.kansasii 
4385 AGGGGG^C^AT^GTG@ft^C@rTaCGG |555^ GC M.smegmatis 



2248 

2248 
2948 
2338 
2271 
1910 
1974 
4425 



2010 2020 2030 2040 

GGGATjlgfiggbGCAGAAACCAGT dGGTA GCGACT-GTTT M. 
GGGATTCGGCCGCAGAAACCAGTGGGTAGCGACT-GTTTA M , 
GGGA10bGGfflCGCAGAAACCAGTGHGBA6CGACT--GTTTA M. 
GGG§Gi|GGraGCA@A7U^CCAGTGRGSAGCGACT-6TTTA M. 
GGGATgCGGjlJcGCAGAgACCAGTGyGEpGCGACT-GTTTA M. 

M, 

GGGATTCGGffibGCAGAAACCAGTGHGHAGCGACT@GTTTO M. 
G@GAgTgGGE^CA@AAACCAGTGy6Ep^GCGACT--GTTTA M. 
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2367 
2367 
3067 
2457 
2390 
1910 
2094 
4544 



2130 2140 2150 2160 

CCGTTAACCqSH— 7VAGGGTGAAGCGGAGAATTTAAGCCC M. 
CCGTTAACCCGT — AAGGGTGAAGCGGAGAATTTARGCCC M , 



eCGTTAACCCG 
CCGTTJUVCC ^ 
C^TTAACCCG 



-AR GGGTGAAGCGGAGARTTTAAGCCC M, 
rCG|GGGTGAAGCG6AGAATTTAAGCCC M. 



^AAGGGTGAAGCGGAGAATTTAAGCCC M 



CCGTTAACCCg H — W^ GGGTGAAGCGGAGAATTTAAGCCC M 
CCGTTAACCC FCTTGqi GGGTGAAGCGGAGAATTTAAGCCC M 
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2250 



2260 



2270 



2280 



2485 
2485 
3185 
2577 
2508 
1910 
2212 
4663 



GTAACGACTTqcCAApTGTCTCAACCATAGACTCGGCGAA M< 
GTAACGACTTCCCAACTGTCTCAACCATAGACTCGGCGAA M. 
GTAACGACTTCfflCAACTGTCTCAACCATAGACTCGGCGAA M, 
GTAACGACTTC I CAACTGTCTCAACCATAGACTCGGCGAA M , 
GTAACGACTTCgjcAACTGTCTCAACCATAGACTCGGCGAA M, 

GTAACGACTTCfflCAACTGTCTCAACCATAGACTCGGCGAA M, 
GTAACGACTTCyCAACTGTCTC7\AC^TAGACTC6GCGAA M. 
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2605 
2605 
3305 
2697 
2628 
1910 
2332 
4782 



2370 2380 2390 2400 

GTTCGGTACGGTTTGTGTAGGATAGGTGGGAGACO trTGAA M . avium 
GTTCGGTACGGTTTGTGTAGGATAGGTGGGAGACTTTGAA M. paratuberc . 
GTTCGGTACGGTTTGTGTAGGATAGGTGGGAGACtBtGAA M. tuberculosis 
GQTCGgTACGGTTTGTGTAGGATAGGTGGGAGACTSTGAA M- phlei 
GTTCGGTlCGGTTTGTGTAGGATAGGTGGGAGACTgTGAA M. leprae 

M. gastri 

GTTCGGTACGGTTTGTGTAGGATAGGTGGGAGACiarGAA M. kansasii 
GgTCG§rACGGTTTGTGTAGGATAGGTGGGAGACTyTGAA M. smegmatis 
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2410 



2420 



2430 



2440 



2645 GCACftBACGCcaBTTfpp;^^^ j^rrnrsq^nfST^rrCTTOn ^ 
393 ^^'^CAGACGCCAGTTTGT^GGAGTCGTTGTTGAAATACC M, 
2645 GCACAGACGCCAGTTTGTGTGGA6TCGTTGTTGAAATACC M, 

3345 HcCt^acgccagttSgBgBggagtc^ m! 
284 e|c::tcgacgccagtt3gggyggagtcgttgttgaaatacc m. 
2737 gclgggacgccagttcgsgtggagtcgttgttgaaatacc m, 

2668 @CgTgGACGC§AGTTgG|gGTG6A6TCG M. 

1^10 „ , ^ _ ^ M. 

2372 gCCTCMACGCCAGT': 
4822 GGTi|ca1c|aCGCCAGT|E3|TG|( 



JTGGAGTCGTTGTTGAAATACC M, 
5TGGAGTCGTTGTT6AAATACC M. 
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2685 

433 

2685 

3385 

324 

2777 

2708 

1910 

2412 

4862 



2450 



2460 



2470 



2480 



ACTCTGATCGTATTGGACACCTAACGTCGAACC fcT-TAE c M. 
ACTCTGATCGTATTGGACACCTAACGTCGAACCCT-TATC M . 
ACTCTGATCGTATTGGACACCTAAGGTCGAACCCT- TATC M . 
ACTCTGATCGTATTGGBcAfflCTAAcBrCGAACCCTlsjik M. 
ACTCTGATCGTATTGG G CM CTAAC C TCGAACCCT Sfl ATC M . 
ACTCtGATCGTATTGG CTAAC C TCGgACCjsiT 3G ATC M . 
ACTCTGAT@GTATTGgACAgCTAACgrCGAACCgi^ M. 

M. 

actctgatcgtattggacacctaacgtcgaaccctIg^tc m . 

ACTCTGATCGTATTGGgc|^TAAC|TCG|ACCgT^ATC M. 
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2924 
2924 
3625 
3017 
2948 
1910 
2652 
5102 



2690 2700 2710 2720 

ggtgtc@::tcaacggataaaaggtaccccggggataacI35 m. avium 

ggtgtcactcaacggataaaaggtaccccggggataac'^ 

ggtgtcBctcaacggataaaaggtaccccggggataac 

GGTGTC G CTCAACGGATAAAAGGTACCCCGGGGATAAC 

ggtgtcyctcaacggataaaaggtaccccggggataac 
ggtgtcSctcaacggataaaaggtaccccggggataac 

GGTGTCgCTCAACGGATATWVGGTACCCCGGGGATAAC 



M. paratuberc. 
M. tuberculosis 
M. phlei 
M. leprae 
M. gastri 
M. kansasii 
M. smegmatis 



2730 



2740 



2750 



2964 
2964 
3665 
3057 
2988 
1910 
2692 
5142 



2760 



GCTGATCTTCCCCAAGAGTCCATATCGACGGGATGGTTTG M . 
GCTGATCTTCCCCAAGAGTCCATATCGACGGGATGGTTTG M . 
GCTGATCTTCCCCAAGAGTCCATATCGACGGGATGGTTTG M , 
GCTGATCTTCCCCAAGAGTCCATATCGACGGGATGGTTTG M , 
GCTGATCTTCCCCAAG AGTCCATATCGACGGGATGGTTTG M . 

M. 

GCTGATCTTCCCCAAGAGTCCATATCGACGGGATGGTTTG M . 
GCTGATCTTCCCCAAGAGTCCATATCGACGGGATGGTTTG M , 
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2770 2780 2790 2800 

3004 GCACCTCGATGTCGGCTCGTCGCATCCTGGGGCTGGAGCA M. avium 
3004 GCACCTCGATGTCGGCTCGTCGCATCCTGGGGCTGGAGCA M.paratuberc . 
3705 GCACCTCGATGTCGGCTCGTCGCATCCTGG6GCTGGAGCA M. tuberculosis 
3097 GCACCTCGATGTCGGCTCGTCGCATCCT6GGGCTGGAGCA M.phlei 
3028 GCACCTCGATGTCGGCTCGTCGCATCCTGGGGCTG^GCA M. leprae 
1^10 M-gastri 
2732 GCACCTCGATGTCGGCtCGTCGCATCCTGGGGCTGGAGCA M.kansasii 
5182 GCACCTC6ATGTCGGCTCGTCGCATCCTGGGGCTGGAGCA M.smegmatis 

1 ' 1 — '• , — 

2810 2820 2830 2840 



3044 


GGTCCCATl 


3044 


GGTCCCAB 


3745 


GGTCCCAB 


3137 


GGTCCCAA 


3068 


GGTCCCAft 


1910 




2772 


GGTCCCAA 


5222 


GGTCCCAA 



^^jXTVj*;Tl:^^TtoTT^ViUW^-ATTAAAGCGGCAC M - av 1 um 
5GTTGGGCTGTTCGCCC-ATTAAAGCGGCAC M . par atube rc . 
ipGTTGGGCTGTTCGCCC-ATTAAAGCGGCAC M- tuberculosis 
feGTTGGGCTGTTCGCCC-ATTAAAGCGGCAC M - phle i 
JGGTTGGGCTGTTCGCCC-ATTAAAGCGGCAC M. leprae 

M.gastri 



3050 



3283 CAAGATCAGGTTT- 
638 CAAGATCAGGTTT- 
3283 CAAGATCAGGTTT- 
3984 CTU^GATCAGGtTT- 
570 CAAGATCAGGTTT- 
3376 CAAGAgCAGGgTT- 
3307 CAA 
1910 

3011 caagatcaggBtt- 

5462 CAAGAgCAGGgrT- 



3060 3070 3080 

-CTCACC dTTTTAGAE GGATAAGnc:r:r: M. 
-CTCACCCTTTTAGAGGGATAAGGCCC M. 
-CTCACCCTTTTAGAGGGATAAGGCCC M. 

GGATAAGGCCC M. 
GGATAAGGCCC M. 
^^AGGGATAAGGCCC M. 

M. 
M. 



CTCACC 
•CTCACO 
■CTCACCe 




CTCACCi 
CTCACCC 




l^GGATAAGGCCC M. 
AGGGATAAGGCCC M. 
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3322 

677 

3322 

4023 

609 

3415 

3309 

1910 

3050 

5501 



3090 



3100 



3110 



3120 



CCCGC- 
CCCGC- 
CCCGC- 
CCCGC- 
CCCGC- 
CCCGC- 



AGACCACGG mTTGATAGGCp AGACCTGGAAgnT M. 

agaccacggg0tQgataggccagacctggaagct M. 
agaHcacgggattgataggccagacctggaagct m. 



-AGARCACGG 
AGAglcACGG 
AGACCACGGGA 




CCCGC- 
CCCGC- 



.TAGGICAGACCTGG7XA6CT M. 
, TAGGgCAGACCTGGAAGCT M- 
ATAG@CCAGACCTG@^GC§ M. 

M- 
M, 

-AGft@CACGGGgr@GATAGGCCAGACCTGGAAGCT M. 
-AGACCACGGGATTGATAGgcCAGACCTGGAAGCg M, 
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107 

59 

107 

70 

70 

209 

120 

69 

70 

104 

64 



130 140 150 160 

GAGTARCACGTGGdcA^TCTGCCCTGCRCTTC-RRflTVTAn m, 
GAGTAACACGTGGGCAATCTGCCCTGCACTTC- GGGATAA M , 
GAGTAACACGTGGGCAATCOgcCCTGCACTTC- GGGATAA M . 
GAGTAACACGTGGGCAATCTGCCCTGCACTTC-6GGATAA M. 
GAGTAACACGTGGGff^AtCTGCCCTGCACTTC- GGGATAA M , 
GAGTTWVCACGTGGG rGJATCTGCCCTGCACTTC- GGGATAA M . 
GAGTAACACGTGGGg^TCTGCCCTGCACTTC§GGGATAA M . 
GAGTAACACGTGGGCAATCTGCCCTGCACKSc-GGGATAA M. 
GAGT!AAC7lCGTGGGCAATCTGCCCTGCACftqC--GGGATAA M, 
GAGTAACACGTGGG^bVATCTGCCCTGCAC P^C~GGGATAA M. 
GAGTAACACGTGGG^ATCTGCCCTGCACTTC-GGGATAA M . 
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424 
376 
424 
387 
389 
528 
439 
386 
387 
420 
381 



450 460 470 480 

AAACCTCTTTCACCATCGACGAAGGTCCGG qTTTTC TCGG M 
AAACCTCTTTCACCATCGACGAAGGTCCGGGTTTTCTCGG M 
AAACCTCTTTCACCATCGACGAAGGTC CGGGTT TTCTflGG 

AAACCTCTTTCACCATCGACGAAGG§|C^5!E 



GTEG 



AAACCTCTTTCACCATCGAGGAAGGTCCGGGTT : TCTCGG 
AAACCTCTTTCACCATCGACGAAGGTCCGGGTTlqTCTCGG 
AAACCTCTTTCACCATCGACGAAGGTCgGGGi 
AAACCTCTTTCACCATCGACGAAGGTCCGGG' 
AAACCTCTtTCACCATCGACGAAGGTCCGGGT' 
AAACCTCTTTCACCATCGACGAAGGTCCGGGTTTTCTCGG M 
AAACCTCTTTCACCATCGACGMGGTgCGGGTTTTCTCGG M 



M 
M 

M 

M, 

TTCTCGG M, 
CTCGG M, 
CTCGG M, 
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490 



500 



510 



520 



429 ATTGACGGTAGGTGGAGAAGAAGCACCGGCCAACTACGTG M. 

568 ATTGACGGTAGGTGGAGAAGAAGCACGGGCCAACTACGTG M. 

464 ATTGACGGTAGGTGGAGAAGAAGCACCGGCCAACTACGTG M. 

416 ATTGACGGTAGGTGGAGAAGAAGCA CCGGCCAA CTACGTG M, 

464 ATTGACGGTAGGTGGAGAAGJ\AGCAC | l ACTACGTG M. 

424 gTTGACGGTAGGTGGAGAAGAAGCACCGGCCAACTACGTG M, 

479 ATTGACGGTAGGTGGAGAAGAAGCACCGGCC7VACTACGTG M. 

426 ATTGACGGTAGGTGGAGAAGAAGCACCGGCCAACTACGTG M. 

427 ATTGACGGTAGGTGGAGAAGAAGCACCGGCCAACTACGTG M. 
460 Is^GACGGTAGGTGGAGAAGAAGCACCGGCCAACTACGTG M. 
421 ATTGACGGTAGGTGGAGAAGAAGCACCGGCCAACTACGTG M. 
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113Q 1140 1150 1160 

1104 TCTCATGTTGCCRGaGGGTRATCcnflGf^nnnTrffTaiiraTir! m. 
1056 TCTCATGTTGCCAGCGGGTAATGCCGGGGACTCGTGAGAG M. 
1098 TCTCATGTTGCCAGCGGGTAATGCgGGGGACTCGTGAGAG M. 
1064 TCTCAtGTTGCCAGCGGGTAATGCCGGGGACTCGTGAGAG M. 
1069 TCTCATGTTGCCAGCfiaGTAATG|M|GGGGACTCGTGAGAG M. 
1208 tCTCATGTTGCCAGCftCGTAATGSIGGGGACTCGTGAGAG M. 
1119 TCTCATGTTGCCAGC^gGTAATGlSgGGGGACTC^^ M. 

1066 TCTCAtGTTGCCAGCGGGTAATGCCGGGGACTCGTGAGAG M. 

1067 TCTCATGTTGCCAGCGGGTAATGCCGGGGACTCGTGAGAG M, 
1100 TCTCATGTTGCCAGCGGGTAATGCCGGGGACTCGTGAGAG M, 
1061 TCTCATGTTGCCAGC^L^TAATG^ljGGGGACTCGTGAGAG M. 
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1290 1300 1310 1320 

1264 CGAATC |c'TTTTAfi AGCCGGACTCAGTTCGGAT|g3GGGTCT M, 
1216 CGAATCCTTTTAAAGCCGG@CTCAGTTCGGATT6GGGTCT M, 
1258 CGAATCCTtTTAAAGCCGGACTCAGTTCGGATTGGGGTCT M. 
1224 CGAATCCTTTTAAAGCCGGfflCTCAGTTCGGATSGGGGTCT M, 
1229 CGT^TCCTTfinAAAGCCGGICTCAGTTCGGATCGGGGTCT M. 
1368 CGAATCCTlK-jAAAGCCGG I CTCAGTTCGGAT C GGGGTCT M. 
1279 CGAATCCTTTTAAAGCCGG I CTCAGTTCGGAT C GGGGTCT M. 

1226 CGAATCCTTTTAAAGCCGG I CTCAGTTCGGAT C GGGGTCT M. 

1227 CGAATCCTTTTAAAGCCGG 1 CTCAGTTCGGAT C GGGGTCT M, 
1260 CGAATCCTTTTAAAGCCGG I CTCAGTTCGGAT C GGGGTCT M. 
1221 CGAATCCTTTBAAAGCCGGgjcTCAGTTCGGATC GGGGTCT M. 
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1330 1340 1350 1360 

1304 GCAACTCGACCCCEIrGAAGTCGGAGTCGCTAGTAATCGCA M, 
1256 GCAACTCGACCCCATGAAGTCGGAGTCGCTAGTAATCGCA M, 
1298 GCAACT@GACCC^TGAAGTCGGAGTCGCTAGTAATCGCA M. 
1264 GGAACTCGACCCCilGAAGTCGGAGTCGCTAGTAATCGCA M. 
1268 GCAACtCGACCCCSTGAAGTCGGAGTCGCTAGTAATCGCA M, 
1407 GCAACTCGACCCC3TGAAGTCGGAGTCGCTAGTAATCGCA M, 
1319 GCAACTCGACCCCSTGAAGTCGGAGTCGCTAGTAATCGCA M, 

1266 GCAACTCGACCCC3TGAAGTCGGAGTCGCTAGTAATCGCA M. 

1267 GCAACTCGACCCC3TGAAGTCGGAGTCGCTAGTAATCGCA M. 
1300 GCAACTCGACCCC3TG7UVGTCGGAGTCGCTAGTAATCGCA M. 
1260 GCAACTCGACCCC3TGAAGTCGGAGTCGCTAGTAATCGCA M. 
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